DISTRIBUTED  SENSOR  NETS 


Sponsored  By: 

Information  Processing  Techniques  Office 
Defense  Advanced  Research  Projects  Agency 


Hosted  by: 
Carnegie-Mellon  University 
Pittsburgh,  Pennsylvania 
December,  1978 


or 

ELEC i  f.fi 

f 

JUN  2  9  1984  ' 


This  document  has  bean  approved 
i  for  public  release  and  sale;  its 
dt  tribution  i3  unlimited. 


84  06  28  117 


THIS  DOCUMENT  IS  BEST 
QUALITY  AVAILABLE.  THE  COPY 
FURNISHED  TO  DTIC  CONTAINED 
A  SIGNIFICANT  NUMBER  OF 
PAGES  WHICH  DO  NOT 
REPRODUCE  LEGIBLY. 


COMPONENT  PART  NOTICE 


This  paper  is  a  COMPONENT  PART  of  the  following  COMPILATION  report: 


(SOURCE): 


To  order  the  complete  COMPILATION  report  use  ad-aim  sai _ . 

The  COMPONENT  PART  is  provided  here  to  allow  users  access  to  individually 

AUTHORED  SECTIONS  OF  PROCEEDINGS/  ANNALS/  SYMPOSIA/  ETC*  HOWEVER/  THE 
COMPONENT  SHOULD  BE  CONSIDERED  WITHIN  THE  context  OF  THE  OVERALL  COMPILATION 
REPORT  AND  NOT  AS  A  STAND-ALONE  TECHNICAL  REPORT* 

The  following  COMPONENT  PART  numbers  comprise  the  COMPILATION  report: 

AD#:  TITLE: 

P003  785  Position  Location  in  a  TIMA.  Network 

P003  786  What  Language  Understanding  Research  Suggests  About  Distributed 

Artificial  Intelligence 

POO3  787  Applications  of  the  Contract  Net  Framework:  Distributed  Sensing 

P003  788  Functionally  Accurate  distributed  Problem  Solving  Systems 

POO3  789  Distributed  Intelligence  for  Situation  Assessment 

POO3  790  DSN  (Distributed  Sensors  Networks)  Problems  —  An  Overview 

P003  791  Strawman  Design  of  a  DSN  (Distributed  Sensors  Networks)  to  Detect 

and  Track  L0w  Flying  Aircraft 

POO3  792  Distributed  Sensors  Networks  (DSN):  An  Attempt  to  Define  the 

Issues 

POO3  793  Multisite  Acoustic  Location 

POO3  79U  Long  Range  Acoustic  Tracking  of  Low  Flying  Aircraft 

POO3  795  High-Level  Protocols 

POO3  796  Tutorial  Survey  of  Algorithms  for  Locating  and  Identifying 

Spatially  Distributed  Sources  and  Receivers 
POO3  797  Single-Site  Detection  and  Target  Parameter  Estimation 

P003  798  Dynamically  Modifiable  Distributed  Systems 

POO3  799  Application  of  Knowledge  Based  Programming  to  Signal  Understanding 

Systems 

POO3  800  Machine  Recognition  and  Understanding  of  Manual  Morse 

P003  801  The  Positioning  Problem  -  A  Draft  of  an  Intermediate  Sunmaiy 

P003  802  The  Development  of  a  Multi-Sensor  System  for  Assessing  a  Threat 

— r— | — - Cyder  of  Battle 


This  document  hat  been  approved 
lot  public  release  ami  tale;  its 
distribution  it  unlimited. _ 


s 


DTIC 

ELEC  i  £» 


DISTRIBUTED 
SENSOR  NETS 


Proceedings  of  a  Workshop 

held  at 

Camegie-Mellon  University 
December  7-8,  1978 


Sponsored  by  the 
Defense  Advanced  Research  Projects  Agency 


A  limited  number  of  copies 
are  available  f rom  the 
Computer  Science  Department 
Carnegie-Meilon  University 


JUN  2  91984 


Pittsbur(h,  PA  1S213 


The  views  and  conclusions  contained  in  this  document  are  those  of  the  authors  and  should 
not  be  interpreted  as  necessarily  representing  the  official  policies,  either  expressed  or 
implied  of  the  Defense  Advanced  Research  Agency  or  the  United  States  Government. 


TABLE  OF  CONTENTS 


t S' 


SESSION  I  -  SYSTEM 

DSN  Problems  —  An  Overvieu 
Jeffrey  A.  Barnett 

USC/ Informat  ion  Sciences  Institute  37 


QFftftftlZATIQN  j 


Strauman  Design  for  a  OSN  to  Oetect  and  Track  Lou  Flying  Aircraft 
R.  Lacoss  and  R.  Ualton 
MIT/Lincoln  Laboratory 

Distributed  Sensors  Wetuorks  (OSN):  An  Attempt  to  Define  the  Issues 
Yechiam  Yemini 

USC/ lnformat ion  Sciences  Institute 


41 


53- 


Comments  on  Lou  Altitude  Air  Defense  Systems 
John  Fielding 

MIT/Lincoln  Laboratory  152 


SES21QM  II  z  SENSOR 

Sensor  Tutorial 
R.  Lacoss  _ 

-"MIT/Lincoln  Laboratory  SI 

Multi  site  Acoustic  Location 
Robert  Ualton 

MIT/Lincoln  Laboratory  63 

Long  Range  Acoustic  Tracking  of  Lou  Flying  Aircraft 
T.  E.  Landers  and  R.  T.  Lacoss 

MIT/Lincoln  Laboratory  68 


ISCHMOLflfiY. 


7 


-  COMMUNICATION 

High-Level  Protocols 
Robert  F.  Sproul  I  and  Dan  Cohen 

Carneg  i  e-Me  I  I  on  University  4  USC/ Informat  ion  Sciences  Institute 

Protocols  for  OSN  (Abetfact  only) 

Oanny  Cohen 

USC/I nf orma t i on  Sciences  Institute 


TECHfp 


LQfil 


y 


rv*- 


78 


124 


i. 


^SESSJflN  z  eSQ££2SIN£  techniques 

Tutorial  Survey  of  Algorithms  for  Locating  and  Identifying  Spatially 
Distributed  Sources  and  Receivers 
N.  Morf,  B.  Fried  lander  and  J.  Newkirk 

Stanford  University  94 

Single-Site  Detection  and  Target  Parameter  Estimation 
Paul  Oemko,  Jr. 

ttlT/Lincoln  Laboratory  105 

Position  Location  in  a  TDflA  Network 
Stephen  Cable 

Rockuel l  Internet ional  1 

The  Positioning  Problem  -  A  Draft  of  an  Intermediate  Summary 
Yechiam  Yemini 

USC/ 1 n forma t i on  Sciences  Institute  137 


AND  ALGORITHMS  • 


SESSION  V  -  DISTRIBUTED  SOFTWARE  ‘ 

Dynamically  Nodi f iabl e  Oistr ibuted  Systems 
A.  N.  Haber mann 
Carnegie-Nel Ion  University 

Language  Design  for  Distributed  Systems 
Peter  Hibbard 
Carnegie-Nel Ion  University 


-^SSION^I  i  y lEnk  RESEARCH  y  fan 


What  Language  Understanding  Research  Suggests  About  Distributed  AI 
Earl  0.  Sacerdoti 
SRI  International 

The  Development  of  a  Nulti-Sensor  System  for  Assessing  a  Threat 
Order  of  Battle 

Thomas  0.  Garvey  and  Nartin  A.  Fischler 
SRI  International 


111 

1S1 


8 


IAS 


Application  of  Knowledge  Based  Programming  to  Signal  Understanding  Systems 
Cor del  I  Green  and  Brian  P.  NcCune 

Systems  Control,  Inc.  115 


li 


I 


tv*  *» 


Stirve  i  1 1  since  Integration  Automation  Project  (SlAP) 
Robert  J.  Orazovich  and  Scottie  Brooks 
Systems  Control,  Inc. 

Machine  Recognition  and  Understanding  of  Manual  Morse 
Albert  Vezza  et.  al. 

MIT/Lincoln  Laboratory 


r  •  •  *  __  r  » 

SESSION  V]I  -  DISTRIBUTED  AI  1 

Applications  of  the  Contract  Net  Framework:  Distributed  Sensing 
Reid  G.  Smith  and  Randal  I  Oavis 
Stanford  University 

Functionally  Accurate  Distributed  Problem  Solving  Systems 
Victor  R.  Lesser  and  Daniel  0.  Cork i 1 1 
University  of  Massachusetts  at  Amherst 

Distributed  Intelligence  for  Situation  Assessment 
F.  Hayes-Roth  and  R.  Uesson 
The  Rand  Corporation 


h*.  r*  r*r  "w  <•  *  .*  v  •*  .  ■  « s  v.” ./  ■_■  «w'  «,_*■  ^ *.  »„**  •*.’  %■  ^ ■ 

•  '•  y 

.‘«V\ 

* . 

AUTHOR  INDEX 

■  •  fc  - 

• 

*.’/*\* 

rm 

r  « 

*  »  •# 

Barnett,  Jeffrey  A. 

37 

Brooks,  Scottie 

119 

v\- 

Cable,  Stephen 

1 

y/-. 

> 

Cohen,  Oan 

78,  12a 

-■ . 

r.m 

’*  V* 

Corkill,  Daniel  D. 

21 

•“  •/.  V 

»/  *, 

12 

V‘v 

.  ■ 

Davis,  Randall 

me 

Demko  Jr.,  Paul 

105 

Drazovich,  Robert  1 

119 

*.  /  "  * 

152 

v-; 

Fielding,  John 

v  * 

i 

Fischler,  Martin  A. 

Friedlander,  B. 

146 

94 

ieei 

y-:>: 

* .. 

K. 

| 

Garvey,  Thomas 

146 

Green,  Cordell 

115 

» 

i 

Habermann,  A.  N. 

111 

Hibbard,  Peter 

151 

■Xj 

-•.’w 

Hayes-Roth,  F. 

27 

. ' .  • 

s 

Lacoss,  R. 

41,  61,  68 

•„  ’  ■ 

-•  -Jt 

AN 

Landers,  T.  E. 

68 

■  ;-v 

Lesser,  Victor  R. 

21 

v'.\ 

McCune,  Brian  P. 

115 

Sv-i 

94 

Morf,  M. 

r - « 

’  > 

Newkirk,  1 

94 

' "  ,* 

Sacerdoti,  Earl  D. 

8 

v.,-0 

Smith,  Reid  G. 

12 

Sproull,  Robert  F. 

78 

yy/ 

Vezza,  Albert 

125 

■ 

Walton,  R. 

41,  63 

■»  •\  * 

* 

Wesson,  R. 

27 

Yemini,  Yechiam 

53.  137 

V.V,* 

£ 

*  *  *  *• 

*:• 

,  lv 

@ 

m 

T 

•  * '  *  * 

vj 

s 

T 


V  K'\  fy  .  M  ■  -V 


l. 


AD-P003  785 


Position  Location  in  a  TDMA  Network 
Stephen  Cable 
Rockwell  International 


Introduction 

Many  of  the  communications  systems  being  developed  to¬ 
day  for  the  tactical  environment  use  some  form  of  TOMA 
in  order  to  share  the  communications  resource  among 
the  network  elements,  in  addition,  a  time  dependent 
waveform  (eg,  JTIDS,  Packet  Radio,  etc.)  is  often  utilized 
to  increase  jam  resistance.  These  systems  require  time 
synchronization  within  the  network  in  order  to  be  able 
to  successfully  communicate.  The  timing  measurements 
necessary  to  provide  network  synchronization  can  be  ex¬ 
tended  to  provide  accurate  range  measurements  between 
network  elements.  These  range  measurements  can  be 
used  to  calculate  the  positions  of  ail  network  elements 
relative  to  some  grid  which  has  been  established  by  the 
network.  The  tactical  value  of  a  relative  position  loca¬ 
tion  function  is  well  established  for  providing  relative 
nagivation.  targeting,  resource  mapping  and  allocation, 
routing  and  low  probability  of  intercept  information. 

The  availability  of  range  measurements,  which  could  be 
used  for  position  location,  and  the  value  of  position  loca¬ 
tion  as  a  tactical  tool  motivated  the  development  of  the 
position  location  methods  described  in  this  paper. 

The  method  for  providing  a  relative  position  location 
function  is  summarized  below.  As  a  part  of  the  network 
initialization  process,  a  relative  grid  is  established  for 
the  network.  Each  element  in  the  network  will  be  re¬ 
sponsible  for  tracking  its  own  position  relative  to  that 
grid.  Accurate  range  measurements  are  obtained  by  ac¬ 
curately  controlling  the  time  of  transmission  (TOT)  and 
by  accurately  measuring  the  time  of  arrival  (TOA)  of 
messages.  The  propagation  delay  measurement  obtained 
from  the  TOT  and  the  TOA  is  multiplied  by  the  speed 
of  propagation  to  obtain  a  range  measurement.  If  the 
transmitted  message  contains  the  position  of  the  trans¬ 
mitter,  the  range  measurement  can  be  used  along  with 
similar  range  measurements  from  other  sources  to  cal¬ 
culate  an  estimate  of  the  receiver's  position  and  the  cur¬ 
rent  network  reference  time.  Periodically,  these  esti¬ 
mates  are  used  by  a  Kalman  filter  linear  estimator  to 
develop  and  correct  a  state  model  of  the  dynamics  of 
the  unit  and  its  interna]  time  bese.  The  rate  at  which 
these  models  must  be  updated  is  a  function  of  the  mobility 
of  the  platform  and  its  position  location  accuracy  require¬ 
ments.  By  using  these  state  models,  each  unit  can  track 
its  estimated  position  and  an  estimate  of  the  network 
reference  time.  This  information  is  then  made  available 
to  other  units  for  use  in  their  position  location  algorithms. 
A  hierarchical  structure  is  used  to  determine  which  po¬ 
tential  sources  of  position  information  should  be  utilized 
by  a  particular  user.  This  structure  prevents  the  feedback 
of  errors  which  can  result  in  an  unstable  condition.  The 
remainder  of  the  paper  will  provide  an  overview  of  the 
major  concepts  introduced  above,  will  discuss  some  of 
the  factors  which  determine  accuracy,  and  will  consider 
the  issues  associated  with  actually  implementing  a  position 
location  function. 


Multilateration 

Multiiateration  is  the  use  of  range  measurements  from 
multiple  sources  to  determine  position  relative  to  the 
locations  of  the  sources.  Multilateration  is  suited  for 
use  in  a  tactical  TOMA  network  for  a  variety  of  reasons, 
including: 

a.  It  utilizes  available  propagation  delay  measurements 

b.  Position  estimates  can  be  calculated  by  each  element 

c.  It  does  not  require  fixed  references 

d.  It  does  not  rely  upon  critical  nodes. 

Given  the  TOT  and  TOA  of  the  message,  an  estimate 
of  the  line  of  sight  range  between  the  transmitter  and 
receiver  can  be  computed.  If  the  received  message  con¬ 
tains  the  absolute  or  relative  position  of  the  source  (trans¬ 
mitter),  the  receiver  can  use  this  information,  in  conjunction 
with  the  measured  range  and  similar  measurements  from 
other  sources,  to  estimate  its  position.  Each  range  mea¬ 
surement  defines  a  circle  of  radius  R  (where  R  =  measured 
range)  with  its  center  at  the  source.  According  to  the 
measurement,  the  receiver  lies  some  where  on  the  ciretle. 

In  the  horizontal  case  (ie,  altitude  assumed  known)  per¬ 
fect  range  measurements  from  three  noncolinear  sources 
will  generate  three  circles  with  a  unique  common  inter¬ 
section  which  is  the  position  of  the  receiver.  However, 
error-corrupted  measurements  will  produce  a  set  of  circles 
with  no  common  intersection  or  an  incorrect  common  inter¬ 
section.  This  result  is  illustrated  in  figure  1  '  .  four 
sources  and  one  receiver.  Given  cl  corrupted 

range  measurements,  the  receiver  «d  to  calculate 

a  position  estimate  which  is  in  some  ^e  optimum. 
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Figure  1.  Multilateration  Measurements  for  Position 
Location. 

By  linearizing  the  problem  using  a  Taylor's  series  expan¬ 
sion  and  employing  a  weighted  least  square  error  criterion, 
this  problem  can  be  efficiently  solved  by  »  microprocessor. 
This  method  requires  an  initial  position  estimate  provided 
by  previous  measurements  or  by  an  initialization  algorithm 
which  is  beyond  the  scope  of  this  paper.  The  result  of 
the  linearization  can  be  observed  in  figure  2a.  The  ith 


range  measurement  now  constrains  the  solution  to  lie 
on  the  straight  line  which  is  tangent  to  the  circle  of  radius 
R(i)  at  the  point  where  the  line  connecting  the  ith  source 
and  the  estimated  position  intersects  the  circle.  The 
weighted  .east  square  error  criterion  is  then  used  to  solve 
for  an  estimate.  The  weights  take  into  account  the  rela¬ 
tive  accuracy  of  the  position  and  time  estimates  of  the 
sources.  The  variance  of  the  error  in  the  estimate  is 
also  calculated  based  on  the  estimated  variance  of  each 
measurement.  The  final  estimate  derived  from  the  mea¬ 
surements  of  figure  1  is  illustrated  in  figure  2b. 
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Figure  3.  Passive  Mode  Observation. 

Upon  reception  of  similar  measurements  from  at  least 
three  nonoolinear  sources,  equation  (1)  can  be  linearized 
and  an  estimate  of  the  clock  offset  and  the  error  in  the 
unit’s  previous  position  estimate  can  be  obtained. 
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Figure  2.  Linearized  Solution  of  Multilateration 
Problem. 

Thus,  a  network  element  provided  with  three  range  mea¬ 
surements  of  adequate  quality  can  compute  an  estimate 
of  its  relative  position. 


In  the  active  mode  of  operation,  a  unit  must  interrogate 
the  sources  from  which  it  desires  to  obtain  position  and 
time  information.  Upon  reception  of  this  interrogation, 
the  interrogated  unit  replies  with  a  position  report  con¬ 
taining  the  TOA  of  the  interrogation  and  the  position 
of  the  source  at  the  time  of  the  reply.  The  form  of  the 
active  mode  range  measurement  is  illustrated  in  figure  4. 
The  two  transmissions  of  the  active  mode  provide  an  in¬ 
dependent  time  measurement,  Dt,  and  an  independent 
position  error  measurement.  Dp.  From  figure  4.  Dt  and 
Dp  can  be  written  as 


Range  Measurements 

The  accuracy  and  quantity  of  range  measurements  avail¬ 
able  to  a  unit  is  an  important  factor  in  the  accuracy  of 
the  position  location  function.  On  the  other  hand,  the 
portion  of  the  network  capacity  required  to  support  posi¬ 
tion  location  messages  should  be  minimized.  Thus,  a  range 
measurement  technique  which  fulfills  both  of  these  re¬ 
quirements  is  desirable.  There  are  two  basic  methods 
(modes)  of  obtaining  a  range  measurement  for  position 
location  and  a  clock  offset  measurement  for  time  synch¬ 
ronization.  One  method,  a  passive  mode  measurement, 
requires  only  the  reception  of  a  position  report  from  the 
source.  The  second  method,  an  active  mode  measurement, 
requires  an  interrogation  of  the  source  of  position  informa¬ 
tion  and  a  reply  containing  the  information  needed  to 
calculate  an  active  range  measurement.  A  comparison 
of  these  two  modes  reveals  that  the  selection  of  the  mea¬ 
surement  mode  is  dependent  upon  the  operational  require¬ 
ments  aad  the  network  environment  of  the  unit. 

In  the  passive  mode  of  operation,  the  unit  estimates  its 
position  and  clock  offset  utilizing  passively  received  posi¬ 
tion  reports.  The  form  of  the  passive  mode  measurement 
is  illustrated  in  figure  3.  By  including  terms  to  indicate 
the  sources  of  error,  the  equation  for  the  measurement,  D, 
in  figure  3  can  be  rewritten  as 


Dt 

=  4tj  +  (m2  -  ml)/2 

(2) 

Dp 

=  ER  *  (m2  -  ml)/2  *  n 

(3) 

where 

mi 

=  ith  TOA  measurement  error 

n  =  errors  introduced  by  incorrect  source  position 
estimate. 

These  equations  show  that  clock  offset  and  position  errors 
have  been  decoupled  in  the  active  mode. 
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where 

n  *  errors  introduced  by  incorrect  source  position 
estimates 
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Figure  4.  Active  (RTT)  Observation. 


Analysis  and  simulation  results  indicate  that  the  active 
mode  range  measurements  produce  better  performance 
that  is  less  dependent  upon  the  geometry  of  the  sources 
than  for  the  passive  mode  case.  However,  there  are  rea¬ 
sons  for  choosing  to  use  the  passive  mode  at  the  expense 
of  degraded  performance.  In  an  operational  scenario. 
some  units  may  be  required  to  operate  in  a  radio  silent 
mode.  At  the  same  time,  it  may  be  important  for  these 
units  to  include  position  information  in  their  infrequent 
transmissions.  Such  units  would  certainly  operate  in  the 
passive  mode.  Another  important  consideration  is  the 
amount  of  network  traffic  required  for  position  location. 

If  network  capacity  is  a  problem  and  an  adequate  source 
geometry  is  available,  then  assigning  some  units  to  the 
passive  mode  can  reduce  the  traffic  requirements.  An¬ 
other  solution  to  this  problem  is  the  use  of  a  hybrid  mode 
employing  active  time  measurements  with  passive  position 
measurements.  A  discussion  of  the  hybrid  mode  is  beyond 
the  scope  of  this  paper.  In  an  attempt  to  satisfy  accuracy 
requirements,  capacity  limitations  and  operational  objec¬ 
tives.  a  typical  network  may  include  active  mode  units, 
passive  mode  units,  and  hybrid  mode  units. 

Kalman  Filter 

In  order  to  successfully  track  the  clock  and  the  position 
of  a  unit  using  noisy  measurements  and  to  extrapolate 
time  and  position  between  sets  of  measurements,  a  state 
model  of  the  clock  and  the  dynamics  of  the  unit  must 
be  maintained.  Thus,  once  position  and  time  estimates 
have  been  obtained  from  the  solution  to  the  linearized 
multilateration  equations,  this  new  estimate  must  be 
combined  with  the  previous  estimates  to  update  the  state 
model.  A  Kalman  filter  was  selected  to  track  the  state 
of  the  system.  A  Kalman  filter  provides  improved  per¬ 
formance  over  simpler  fixed  gain  filters  (eg  a  ,  -  3  tracker) 
and  it  provides  the  error  covariance  matrix  necessary 
to  implement  the  covariance  defined  hierarchy  described 
below.  The  Kalman  filter  is  defined  by  the  state  model 
description  of  the  system  and  the  error-corrupted  mea¬ 
surements  which  are  available  for  updating  the  model. 

For  a  ground,  mobile  network,  a  six-dimensional  state 


Update 

s  (new)  =  As  (old)  *  K  (new)  (  z  (new)  -  HA  s  (old)] 
K  (new)  *  C  (ext)  HT  [  H  C  (ext)  HT  *  R  (new)]'1 
C  (new)  =  C  (ext)  -  K  (new)  H  C  (ext) 
where 

s  =  state  vector 

A  =  state  transition  matrix 

C  =  state  vector  error  covariance  matrix 

K  =  Kalman  filter  gain  matrix 

z  -  measured  values 

H  =  linear  transformation  from  s  to  z 

R  =  measurement  error  covariance  matrix. 

These  Kalman  filter  equations  and  the  multilateration 
measurements  provide  the  basis  for  tracking  the  relative 
position  of  network  elements. 

Network  Architecture 

The  ability  of  each  network  element  to  perform  position 
location  is  dependent  upon  the  existence  of  a  common 
relative  grid  and  the  reception  of  an  adequate  number 
of  useable  position  reports.  The  network  structure  of 
the  position  location  function  should  perform  four  major 
tasks.  Those  tasks  are: 

a.  Support  time  synchronization  in  the  network 

b.  Establish  a  rectilinear  coordinate  grid  for  relative 
navigation 


vector  has  been  demonstrated  to  provide  an  adequate 
state  model.  The  six  components  are  clock  offset,  clock 
drift  rate,  x-direction  position,  y-direction  position,  x- 
direction  velocity,  and  y-direction  velocity.  This  model 
assumes  that  altitude  is  provided  from  an  external  mea¬ 
surement.  independently  from  the  position  location  func¬ 
tion.  A  variant  of  the  above  state  vector  would  use 
heading  and  velocity  in  place  of  velocity  in  the  x  and  y 
directions.  The  measurement  used  by  the  Kalman  filter 
would  be  the  solution  to  the  linearized  multilateration 
equations  and  its  associated  covariance  matrix.  Platforms 
exhibiting  higher  mobility,  eg  aircraft,  would  require  an 
extended  state  vector  and  additional  measurements  from 
onboard  navigation  equipment  in  order  to  successfully 
track  the  platform. 

Given  a  measurement  and  its  variance,  a  Kalman  filter 
attempts  to  update  the  state  model  in  a  way  such  that  the 
trace  of  the  covariance  matrix  of  the  state  vector  is  mini¬ 
mized.  The  oasic  Kalman  filter  equations  are  summarized 
below: 

Extrapolation 

s(ext)  *  As  (old) 

C  (ext)  *  A  C  (old)AT 


c.  Ensure  an  adequate  supply  of  position  reports  to  sup¬ 
port  any  unit  attempting  to  perform  position  location 

d.  Provide  for  stability  and  robustness  of  the  position 
location  function. 

Due  to  the  interrelationship  between  time  synchronization 
and  position  location,  any  network  structure  which  per¬ 
forms  the  last  three  tasks  will  support  time  synchroniza¬ 
tion  in  the  network.  For  applications  where  relative 
navigation  is  not  required,  the  position  location  algorithms 
can  provide  time  tracking  only.  The  clock  offset  measure¬ 
ments  can  be  provided  by  active  mode  measurements 
without  position  or  by  passive  mode  measurements  with 
a  rough  position  estimate. 

In  conjunction  with  the  network  initialization  procedure, 
the  position  location  network  structure  should  define 
a  relative  rectilinear  grid  and.  if  possible,  a  geodetic 
grid.  Some  of  the  network  elements  are  given  the  re¬ 
sponsibility  for  establishing  the  grid.  These  elements 
are: 

a.  Master  Unit  (MU).  An  element  whose  relative  posi¬ 
tion  is  defined  to  be  the  origin  of  the  grid. 


t>.  Relative  Position  Reference  (RPR).  An  element 
which  knows  its  position  relative  to  the  MU  and  one 
other  RPR.  These  are  at  least  two  RPR's  in  the  net¬ 
work.  The  ranges  between  the  MU  and  the  RPR’s 
may  be  obtained  through  round-trip  timing  measure¬ 
ments.  but  some  prior  information  must  be  available 
to  obtain  an  unambiguous  estimate  of  their  relative 
positions. 

c.  Absolute  Position  Reference  (APR).  An  element 

with  an  accurate  estimate  of  its  geodetic  coordinates 
obtained  from  some  source  other  than  the  position 
location  function. 

During  network  initialization,  the  MU  and  RPR’s  define 
a  relative  rectilinear  grid.  Given  two  APR’s,  the  relative 
grid  can  be  mapped  onto  a  geodetic  grid  through  a  co¬ 
ordinate  transformation  algorithm.  Once  the  grid  is  es¬ 
tablished,  the  loss  of  an  MU.  RPR.  or  APR  does  not  disable 
the  position  location  capability  of  the  network.  An  alter¬ 
nate  master  unit  (AMU)  is  designated  to  assume  the  role 
of  the  MU  upon  loss  of  the  original  master  unit.  This 
feature  enhances  the  survivability  of  the  position  location 
function  in  the  network. 

The  covariance  defined  hierarchy  is  used  to  select  the 
useable  sources  of  position  information  from  the  set  of 
all  elements  within  line  of  sight  (LOS)  of  a  particular 
element.  Each  element  includes  a  quantized  estimate 
of  the  error  variances  of  its  position  and  time  estimates 
in  its  position  report.  These  variances  are  provided  by 
the  Kalman  filter  of  each  element.  The  position  report 
also  contains  information  indicating  the  number  of  relays 
between  that  element  and  the  MU.  An  element  selects 
the  sources  with  the  most  accurate  position  information 
which  are  closer  to  the  MU  than  it.  In  this  manner,  posi¬ 
tive  feedback  of  errors  is  eliminated. 


ai  accuracy  requirements  and  the  mooility  of  the 
platform,  the  two  basic  sources  of  error,  which  are 
a  function  of  the  update  rate,  are  errors  introduced 
by  state  extrapolation  and  errors  caused  by  combining 
position  reports  received  at  different  times  into  one 
measurement.  These  errors  increase  as  the  mobility 
of  the  platform  increases  and  decrease  as  the  update 
rate  increases.  A  highly  mobile  platform  exhibits 
significant  acceleration  and  higher  order  derivatives 
of  motion.  These  unmodeled  states  can  cause  diver¬ 
gence  of  the  estimate  unless  they  are  compensated 
for  by  age  weighting  or  similar  techniques.  An  anal¬ 
ysis  of  these  errors  was  performed  using  the  Kalman 
filter  error  sensitivity  analysis.  From  these  results 
and  given  the  accuracy  requirement,  the  expected 
dynamics  of  the  platform,  and  the  conditions  (source 
error,  GDOP,  etc.)  under  which  the  accuracy  is  re¬ 
quired.  a  minimum  update  rate  can  be  selected.  This 
interaction  is  illustrated  in  figure  5.  where  maximum 
update  periods  were  selected  for  various  platforms 
and  performance  goals. 
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Figure  5.  An  Example  of  Update  Rate  vs.  Platform 
Goals. 


nation  Delay  Measurement 


Performance  Considerations 

The  algorithm  and  network  architecture  introduced  above 
were  the  subject  of  an  in-depth  analysis  to  determine 
the  accuracy  of  the  system  and  its  impact  upon  network 
operation,  this  effort  utilized  an  analysis  of  the  state 
model,  a  Kalman  filter  error  sensitivity  analysis  program 
and  a  Monte  Carlo  simulation  to  characterize  the  effects 
of  various  factors  upon  performance.  It  was  found  that 
the  performance  of  the  position  location  function  is  a 
complex  function  of  the  update  rate,  the  mobility  of  the 
platform,  the  TOA  measurement  accuracy,  the  propaga¬ 
tion  effects,  the  quality  and  topology  of  the  sources  used, 
and  several  other  factors.  By  characterizing  the  indivi¬ 
dual  and  cumulative  effects  of  these  factors,  required 
values  for  parameters,  such  as  required  update  rate,  re¬ 
quired  TOA  estimator  accuracy,  etc.,  which  will  provide 
adequate  accuracy  for  a  given  operational  scenario  can 
be  determined.  The  impact  of  three  of  the  performance 
factors  is  summarized  below. 


The  sources  of  error  in  the  propagation  delay  mea¬ 
surement  include  TOT  errors,  propagation  effects, 
and  TOA  errors.  The  sum  of  all  these  errors  can 
be  lumped  together  and  called  the  TO  A  measurement 
error.  The  impact  of  TOA  measurement  errors  on 
position  accuracy  is  a  function  of  the  mode  of  the 
unit  (active  or  passive),  the  number  of  sources  used, 
the  dynamics  of  the  unit,  and  the  state  mooel  used 
by  the  Kalman  filter.  The  effect  of  TOA  uncertainty 
was  characterized  under  venous  conditions  using 
the  analysis  tools  mentioned  earlier.  The  uncertainty 
in  the  TOA  measurement  increases  the  uncertainty 
of  the  measurement  supplied  to  the  Kalman  filter. 
This  increases  the  response  time  of  the  filter,  making 
it  more  difficult  to  track  mobile  platforms.  The  anal¬ 
ysis  indicates  that  a  TOA  of  one  sigma  uncertainty 
of  100  ns  may  be  acceptable  for  a  slowly  moving 
platform  (eg,  manpack).  but  the  accuracy  must  in¬ 
crease  as  the  mobility  increases  in  order  to  maintain 
the  same  accuracy  and  update  rate. 


Update  Rate- Platform  Dynamics 


Network  Top 


A  major  impact  of  the  position  location  function 
upon  a  network  is  the  portion  of  the  communications 
capacity  of  the  network  which  must  be  allocated 
to  position  location.  Ideally,  the  minimum  number 
of  position  reports  necessary  to  meet  the  performance 
goal  should  be  used.  This  implies  that  the  update  rate 
of  each  unit  should  be  as  low  as  possible.  The  major 
factors  determining  the  update  rate  are  the  operation- 


The  topology  of  the  network  has  a  significant  impact 
on  the  position  location  accuracy  of  the  units  in  the 
network.  A  number  of  different  network  properties 
determine  the  configuration  and  quality  of  sources 
available  to  an  individual  unit.  Some  of  these  proper¬ 
ties  are  listed  in  table  1. 


Table  1.  Network  Properties  and  Source  Quality. 


PROPERTY 


Covariance  defined  hierarchy 
Number  of  units  in  net 
Area  of  coverage 
Location  of  MU,  RPR  and  APR 
Network  dynamics 
Network  terrain 


The  geometric  dilution  of  precision  (GDOP)  is  a  mea¬ 
sure  of  the  errors  introduced  by  the  geometry  of 
the  sources.  It  is  usually  defined  as 

'"TT  ”  "  (4) 

v  jx  ay  am 


=  variance  of  the  solution  for  position  in  the 
x-direction 

=  variance  of  the  solution  for  position  in  the 
y-direction 

=  variance  of  the  measurement  error  from 


As  GDOP  increases,  the  accuracy  of  a  measurement 
decreases.  GDOP  is  a  function  of  the  method  of 
range  measurement.  The  correlation  between  time 
and  position  errors  in  the  passive  mode  accounts 


COMMENTS _ 

Determines  quality  and  choice  of  sources 
Effects  number  pf  sources  available 
Effects  network  connectivity 
Effects  source  quality 
Effects  availability  and  quality  of  sources 
Effects  network  connectivity 


for  much  of  the  GDOP  present  in  that  mode.  As 
a  comparison  between  passive  and  active  mode  GDOP. 
consider  the  contours  of  equal  GDOP  plotted  in  fig¬ 
ure  6  and  figure  7.  Obviously,  for  the  source  con¬ 
figuration  of  the  figures,  the  active  mode  should 
provide  better  performance.  This  premise  is  borne 
out  by  the  simulation  results  recorded  in  figure  8 
for  a  unit  traveling  with  constant  velocity  along  the 
path  indicated  in  figure  S  and  figure  7. 

One  response  of  a  passive  mode  unit  with  poor  per¬ 
formance  due  to  GDOP  would  be  to  switch  to  active 
mode  operation.  However,  operational  objectives 
may  force  some  units  to  operate  in  the  passive  mode. 
In  that  case,  it  is  the  responsibility  of  the  network 
to  provide  adequate  sources  of  position  information. 

implementation  issues 

The  impact  of  the  position  location  function  can  be  identi¬ 
fied  in  three  areas.  These  areas  are:  (1)  network  through¬ 
put,  (2)  hardware,  and  (3)  software. 


Figure  6.  GDOP  Contours  for  Basic  Position  Location 
Element  (Passive). 


Figure  7.  GDOP  Contours  for  Baste  Position  Location 
Element  (Active). 
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Figure  9.  POSLOC  Bit  Rate  Example. 


Figure  8.  Simulation  Results. 


Throughput 


The  throughput  or  data  distribution  capability  of 
the  network  is  impacted  in  two  ways.  First,  some 
amount  of  overhead  traffic  is  introduced  into  the 
network  to  support  the  requirement  for  position  re¬ 
ports.  This  traffic  is  in  the  form  of  overhead  bits 
added  to  existing  messages  and  additional  messages 
created  solely  for  the  purpose  of  carrying  position 
information.  Second,  some  of  the  processing  time 
of  the  unit  must  be  used  to  process  position  informa¬ 
tion.  For  some  networks,  processing  delays  can  be¬ 
come  the  limiting  factor  on  throughput.  Consequently, 
the  amount  of  processing  required  for  POSLOC  could 
have  an  adverse  effect  on  network  throughput. 

In  figure  9,  the  average  bits/sec  of  position  informa¬ 
tion  received  by  an  average  unit  is  plotted  as  a  func¬ 
tion  of  the  number  of  units  within  LOS  of  that  unit. 
This  is  a  realistic  measure  of  network  loading  because 
the  position  reports  are  not  relayed  when  used  solely 
for  computing  position.  Several  plots  are  included 
in  figure  9,  representing  several  network  make-ups 
and  position  locaton  strategies.  The  important  point 
to  note  is  that  for  less  than  20  LOS  units,  all  of  the 
scenarios  require  less  than  1  kb/s  of  position  informa¬ 
tion. 

The  amount  of  processing  time  required  by  the  posi¬ 
tion  location  algorithm  can  be  estimated  by  deter¬ 
mining  the  time  required  for  the  floating  point 
operations.  An  estimate  was  done  for  a  16-bit  pro¬ 
cessor  with  a  hardware  multiply  and  divide.  With 
a  6-s  update  rate,  approximately  3  percent  of  the 
processors  capability  would  be  used  for  position  lo¬ 
cation.  Thus,  this  type  of  processor  should  be  able 
to  provide  position  location  without  seriously  increas¬ 
ing  node  delay. 


Hardware 

In  general,  the  hardware  would  be  impacted  in  the 
following  three  areas: 

a.  Accurate  TOA  estimator 

b.  Accurate  TOT  control 

c.  Increased  memory 

The  TOA  estimator  hardware  is  determined  by  the 
type  of  waveform  used.  In  some  systems,  a  delay 
lock  loop  is  used  to  track  the  TOA  of  the  incoming 
signal.  In  a  ground,  mobile  environment,  it  is  de- 
sireable  to  detect  the  leading  edge  of  the  received 
multipath  signal.  Detection  of  the  leading  edge  may 
require  coherent  averaging  of  the  signal  in  order 
to  increase  the  signal  to  noise  ratio  of  the  leading 
multipath  component.  The  TOT  can  be  controlled 
by  retiming  the  signal  immediately  before  modula¬ 
tion  of  the  IF.  A  survey  of  TOA  estimator  and  TOT 
control  techniques  indicates  that  the  TOA  estimator 
and  TOT  controller  add  to  the  complexity  of  the 
system,  but  their  complexity  is  not  unreasonable. 

An  algorithm  supporting  two  modes  of  operation 
and  coordinate  conversion  routines  could  be  stored 
in  less  than  8  k  16-bit  words.  Thus  the  hardware 
modifications  required  are  feasible. 

Software 

The  use  of  range  measurements  to  provide  time  syn¬ 
chronization  and  position  location  requires  a  signi¬ 
ficant  amount  of  software.  The  software  tasks  would 
include  the  following  functions: 

a.  Position  location  algorithms 

1.  Multilateration  solution 

2.  Kalman  filter 

b.  Interface  with  the  operating  system 

c.  Creation  and  transmission  of  position  reports 

d.  Protocols  for  position  reports. 


Conclusion 


/ 


v 

In  summary,  usable  position  accuracies  can  be  obtained 
within  a  TOMA  network.  A  position  location  algorithm 
utilizing  multilateration  and  a  Kalman  filter  linear  esti¬ 
mator  can  track  the  position  and  clock  offset  within  each 
network  element.  An  analysis  of  this  method  shows  that 
performance  is  a  complex  function  of  a  number  of  factors. 
With  the  tools  which  have  been  developed  for  analyzing 
performance,  the  obtainable  accuracy  can  be  predicted 
or  the  parameters  required  to  meet  a  performance  goal 
can  be  chosen,  given  a  particular  scenario.  The  cost  of 
implementing  position  location  in  terms  at  network 
throughput  and  hardware  complexity  should  not  be  un¬ 
reasonable.  Consequently,  position  location  may  be  e 
valuable,  obtainable  addition  to  a  tactical  data  distri¬ 
bution  network. 

\ 
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ABOUT  DISTRIBUTED  ARTIFICIAL  INTELLIGENCE 
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Natural  language  communication  requires  that 
dialogue  participants  know  not  only  the  structure 
of  language  but  also  something  about  the  subject 
natter  under  discussion  and  about  the  processes 
through  which  concepts  in  the  subject  area  say  be 
expressed  by  naans  of  words,  phrases,  and  sentences 
in  the  language.  Similarly,  a  rich  dialogue  among 
distributed  intelligent  processors  will  require 
that  the  processors  have  a  (partial)  knowledge  base 
in  cannon  and  that  they  employ  and  understand 
cannon  graundrules  about  how  to  convey  portions  of 
that  knowledge  base  within  the  available 
communication  protocols.  Therefore,  designers  o f 
distributed  artificial  intelligence  (DAI)  systens 
night  find  useful  concepts  in  the  designs  of 
artificial  intelligence  systens  for  natural 
language  understanding. 


Computer  science,  as  opposed  to  traditional 
mathematics,  views  computation  as  a  process 
performed  with  finite  resources  over  time.  In  an 


analogous  manner,  natural  language  understanding 
research  in  artificial  intelligence,  as  opposed  to 
traditional  linguistics,  is  increasingly  concerned 
with  communication  as  a  process  performed 

fefllla  .“Mflursaa  over  time.  DAI  systems  Of 
any  power  will  certainly  also  have  to  view 
Interprocessor  oommunicstion  as  a  process  performed 
with  finite. resources  over  time.  The  particular 
perspective  of  artificial  intelligence  on  the 
problem  of  understanding  natural  language  may  help 
us  to  articulate  some  issues  of  importance  that 
must  be  faced  in  developing  distributed  systems 
that  display  intelligent  behavior. 


*  The  preparation  of  this  paper  was  supported  by 
the  Advanced  Research  Projeots  Agency  of  the 
Department  of  Defense.  It  was  prepared  in  haste, 
for  which  I  apologize  to  the  reader.  I  wish  to 
thank  Barbara  Groaz  and  Peter  Hart  for  their 
helpful  suggestions.  Please  don't  expect  them  to 
stand  firmly  behind  the  papers'  conclusions, 
though,  ror  that  matter,  don't  expect  me  to! 


This  brief  note  will  attempt  to  draw  an 
extended  analogy  between  natural  language 

communication  and  interprocess  communication, 
thereby  hopefully  identifying  in  advance  some  of 
the  gaps  that  must  be  filled  as  the  processors  that 
are  communicating  become  increasingly  powerful. 

A.  Introduction 

Communications  specialists  have  generally 
concerned  themselves  with  the  format  and  content  of 
individual  classes  of  interprocess  messages,  rather 
than  on  tha  ongoing  Interaction  (spanning  many 
instances  of  many  classes  of  message)  through  which 
particular  information  is  communicated.  This  Is 
analogous  to  the  linguists'  traditional  concerns 
with  the  form  and  meaning  of  individual  words  and 
sentences.  In  contrast,  AI  researchers  on  language 
understanding  view  communication  aa  an  activity 
performed  by  two  or  more  cooperating  parties.  This 
viewpoint  on  language  understanding  systems 
suggests  that  a  crucial  issue  for  DAI  is  a  careful 
articulation  of  the  processes  that  underlie 
participation  in  dialogue  (as  opposed  to  the 
structuring  of  the  individual  interprocess  messages 
and  communication  protocols). 

Below  we  will  characterize  three  kinds  of 
knowledge  that  language  understanding  research 
suggests  must  be  available  to  a  communicating 
process.  Than  we  will  discuss  the  activities 
involved  in  interpreting  and  responding  to  a 
massage. 

8-  Know  ledge  about  the  Sub  'act  Dnmal  o 

Communication  between  people  or  processors  can 
be  viewed  as  the  incremental  sharing  and  building 
of  their  respective  knowledge  bases.  Recent  work 
in  computational  linguistics  has  demonstrated  tae 
importance  to  each  participant  in  a  dialogue  of 
having  a  very  rich  knowledge  base  that  is  much  sore 
than  a  static  description  of  the  facts  that  are 
currently  true  in  the  domain  of  discourse.  The 
knowledge  base  must  encode  a  dynamic  environment 
consisting  of  various  actors,  objects, 
relationships  and  events,  orcered  or  partially 
ordered)  witn  respect  to  time.  It  cust  cescnce 


not  only  the  environment  as  it  truly  is,  but  also 

tha  knowledge  and  beliefs  about  the  environment 
that  are  bald  by  eaeb  participant  in  the  dialogue. 

C.  Knowledge  about  Contest 

Natural  language  is  used  for  communication  in 
a  dynamically  changing  context,  in  utterance  in  a 
dialogue  cannot  typically  be  interpreted  is 
isolation;  it  nust  be  analyzed  within  the  context 
in  which  it  was  produced.  An  interpretation  is 
influenced  by  the  current  state  of  the  envlroonent, 
by  a  bistory  of  the  previous  states,  by  the  overall 
structure  and  content  of  the  dialogue,  by  knowing 
who  produced  the  utterance  and  for  whoa  it  was 
intended,  and  by  the  preceding  utterances  in  the 
dialogue. 

this  conplex  collection  of  required  state 
information  renders  genuine  comprehension  of 
natural  dialogue  beyond  the  current  state  of  the 
art.  It  Is,  however,  an  extremely  efficient  means 
of  caamunieating  parsimoniously  over  a  noisy  medium 
when  there  is  sufficient  processing  power  available 
on  both  sides  of  tha  communications  link.  By  each 
having  all  this  knowledgs  about  the  subject  domain 
and  the  current  context  of  the  interaction, 
processors  can  communicate  using  many  fewer  bits. 
Furthermore,  because  the  processors  are  continually 
engaged  In  augmenting  and  checking  a  shared 
knowledge  base,  errorful  transmissions  are  much 
more  likely  to  be  noticed,  and  a  subdialogue 
requesting  confirmation  of  the  suspicious  message 
can  be  Initiated. 

Employing  this  state  information,  then,  may 
provide  significant  additional  efficiency  sad 
reliability  in  the  oonauMcationa  process.  On  the 
other  hand,  maintaining  and  exploiting  this  state 
information  imposes  a  very  significant  additional 
computation  load.  Researchers  exploring  the 
distribution  of  artificial  intelligence 
capabilities  will  need  to  evaluate  the  tradeoffs 
between  tha  increased  processing  required  and  tha 
enriched  communication  provided  by  this  approach. 


D.  Knowledge  about  f  nmmunl  gaf.' ng 

To  enable  the  rich  interactions  we  have  been 
daaoribing,  a  third  kind  of  knowledge  is  needed. 
This  la  kaowledgs  about  the  "rules  of  the  game"  of 
communicating.  In  typical  systems  that  perform 
interprocess  complication  this  consists  of  no  more 
than  the  encoding  of  routines  that  can  either 
create  or  interpret  instances  of  particular  classes 
of  messages.  By  analogy  with  natural  language 
isidera tainting  systems,  comma! eating  processes  may 
also  have  to  know  enough  about  the  activity  of 
communication  Itself  to  determine  when  the  focus  of 
tha  communication  is  shifting,  what  the  current 
goals  of  the  sender  are  that  lie  behind  his  current 
transmissions ,  and  when  'and  why)  the  current 
sequence  of  transmissions  is  coherent  as  a  whole. 
The  cues  for  these  kinds  of  information  are  often 
encoded  in  subtle  ways  in  natural  language 
dialogues.  Human  participants  seem  able  to  perform 
the  deductions  required  to  pick  up  these  cues 
easily.  Language  understanding  systems  have  been 
rather  poor  at  this  to  date.  By  encoding  the  cues 
more  explicitly  for  interprocess  communication,  the 
requirement  for  the  Individual  processors  to 
perform  complex  deductions  can  be  greatly  reduced. 
This  aspect  of  communication,  then,  appears  to  be 
one  that  can  be  Incorporated  rather  easily  Into  DAI 
systems. 

e.  Intarantlng  a  MsaaMB 

In  analyzing  and  interpreting  messages 
expressed  la  natural  language,  a  variety  of  kinds 
of  Information  must  be  brought  to  bear.  Firstly, 
there  is  knowledge  about  the  syntax,  how  individual 
words  are  combined  into  phrases  and  how  phrases  are 
combined  into  sentences.  Secondly,  there  is 
lexical  knowledge,  about  the  meanings  of  individual 
words  and  the  roles  they  can  take  on  within  Largtr 
phrases.  Thirdly,  there  is  knowledge  about  the 
mapping  between  the  phrases  of  the  input  and 
descriptions  of  objects  in  the  internal 
repreeentatlon  of  the  subject  domain.  (Strictly 
speaking,  this  is  what  philosophers  mean  when  they 
refer  to  semantics,  although  the  term  is  elnost 
always  used  in  a  much  wider  sense  in  the  literature 
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of  artificial  intelligence.)  Finally,  knowledge 

about  the  currant  qtate  ia  needed  to  rafine  tba 
descriptions  of  objaot*  into  designations  of 
particular  antitiaa  in  tn*  "real  world." 

While  those  typaa  of  knowledge  are  listed  in 
order  of  increasing  complexity  and  difficulty  of 
uaa.  they  are  not  employed  in  a  atrictly  linear 
order.  Since  the  analysis  at  amah  level  affeota 
the  oonfldanco  in  the  conclusions  dram  at  other 
lavela,  the  overall  interpretation  is  usually  built 
up  incrementally  with  many  partial  contributions 
from  all  levels. 

The  designer  of  a  system  that  involves 
Interprocess  communication  typically  builds  in  to 
the  communicating  processes  knowledge  of  the  first 
two  sorts  described  above.  He  worries  about  the 
structure  of  each  sassage  and  about  the  values  and 
meanings  of  the  fields  within  those  messages.  The 
other  types  of  knowledge  are  "hard-wired"  into  his 
programs,  and  are  typically  extremely  simple.  Each 
message  typically  has  an  unambiguous  meaning 
independent  of  its  ordering  within  the  overall 
dialogue. 

This  design  of  a  communication  protocol  is 
appropriate  for  situations  where  the  processing 
cost  of  sending  or  receiving  each  message  must  be 
kept  low.  The  use  of  the  semantically  oriented 
kinds  of  knowledge  makes  the  processing  for  massage 
transmission  very  much  more  expensive.  However,  if 
processing  at  the  source  and  destination  is 
relatively  cheap  (and  we  expect  this  to  be  the  case 
for  DAI  system),  the  overall  oost  of  the  system 
might  be  minimized  by  trading  off  higher  processing 
requirements  for  lower  bmadwidths  and  higher  noise 
levels, 

?.  faaBQ«Uag  la  x  ttaaaiga 

He  have  Just  sketched  the  (rather  complex) 
process  by  which  a  message  might  be  Interpreted  by 
a  processor  in  a  DAI  system.  Once  the  message  is 
interpreted,  it  must  then  be  reponded  to.  As  might 
be  expected,  this  is  also  a  rather  coaplex  process 
for  a  natural  language  understanding  system,  and 
will  probably  be  complex  for  a  DAI  system  as  well. 


The  complexity  of  the  response  derives  mostly 

from  the  need  to  update  and  maintain  the  complex 
laowledge  about  state  information  described  in 
Section  C  shove.  The  response  must  include: 

*  a  cheek  for  the  validity  of  the  current 

assumptions  about  the  state  of  the  other 
communicating  process; 

*  a  check  for  cues  that  the  sender  of  the 
message  is  shifting  the  focus  of  the 
communication; 

*  a  determination  of  the  overt  actions  to  be 
taken  in  response  to  the  message; 

*  a  determination  of  whether  (and  under  what 
conditions  and  when)  a  return  message  is 
requested  or  required; 

*  the  generation,  if  needed,  of  a  return 
message  (which  will  Involve  encoding 
sufficient  information  for  the  other 
process  to  perform  the  same  set  of  tasks). 

As  was  the  case  with  the  task  of  interpreting 
messages,  responding  to  messages  will  be  easier  for 
coaputer-to-oaaputer  interactions  than  for  natural 
language  communication.  By  requiring  explicit 
indications  of  shifts  of  focus  and,  perhaps,  an 
explicit  indication  of  nonstandard  or  unexpected 
assumptions,  the  response  to  messages  can  probably 
be  performed  with  tolerable  efficiency. 

o.  CnaalualBoa 

He-fcava  speculated  on  the  possible  relevanoe 

,  rTe  llj'f  t, 

of  the  -Ur  approach  topiatural  language 
understanding  to  the^problea  of  communication 
between  processors  in  a  distributed  artificial 
intelligence  system.  While  the  processing  required 
to  interpret  ■  natural  language  is  almost  certainly 
far  more  than  a  DAI  system  needs,  a  simplified  form 
of  this  processing  that  performs  all  or  most  of  the 
tasks  performed  in  understanding  natural  language 
zsy  will  be  required.  This  suggests  that  designers 
of  DAI  systems  might  best  design  their 
oamunlcatlons  protocols  by  narrowing  down  the 
capabilities  required  for  natural  language 
communication  rather  than  by  building  up  froa  the 
traditional,  individual-msssage-orlsnted  approach. 
The  key  pointT~have  tried  to  asks  is  that,  to 
develop  a  robust  distributed  system  that  is  able  to 
communicate  over  a  noisy,  relatively  narrow- 
bandwidth  channel,  one  must  consider  not  Just  the 


'0 


format  and  content  of  the  individual  messages  that 

ara  used  to  communicate,  but  also  the  role  that 
sequences  of  messages  play  in  the  overall  process 
of  communication.  r 
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Wa  describe  ijthe  tnrttaMiatton  and  operation  of  a 
distributed  sensing  system  baaad  on  tha  contract  not 
framework.  in  a  departure  froai  earlier  eya  terns,  taak 
dlatrlbutkm  la  viewed  aa  a  local  mutual  adactfon  prooess,  a 
dlacuaaion  earned  on  between  a  node  with  a  taak  to  be 
executed  and  a  group  of  nodea  that  may  be  able  to  axeeute 
the  taak.  This  leada  to  the  uaa  of  a  control  formalism  baaed 
on  a  contract  metaphor,  in  whioh  taak 
correaponda  to  contract  negotiation. 


Olatributed  Problem  Solving!  Overview 


Oiatrlbuted  problem  salving  is  earned  out  in  a  processor 
architecture  in  which  the  individual  nodes  include  meawry  aa 
well  as  processing  capability,  in  such  a  problem  solver, 
control  is  decentralized,  the  nodea  are  loosely  coupled  <l.e., 
they  spend  a  far  greater  percentage  of  time  M  computation 
than  In  communication  with  other  nodea),  they  communicate 
via  messages,  and  they  cooperate  m  the  solution  of  a  single 
overall  problem. 


A  framework  for  distributed  problem  salving  has  been 
developed  that  specifies  mechanisms  for  communications, 
control,  and  knowledge  organization.  Tha  fraawwork  la  based 
on  the  human  model  of  experts  that  cooperate  to  solve 
problems  by  transfer  of  massages  and  use  a  contract 
negotiation  process  to  distribute  tasks  to  be  executed 
concurrently.  In  the  distributed  problem- solving  context,  the 
human  experts  correspond  to  processor  nodea.  Each  node 
contains  a  number  of  task-specific  knowledge- sources  (KSa). 
in  this  paper  we  writ  focus  on  the  communications  and  control 
aspects  of  the  framework. 


A  key  problem  that  must  ba  resolved  M  a  distributed 
problem  solver  is  now  nodea  with  tasks  to  ba  executed  find 
other  nodes  capable  of  executing  those  tasks.  We  wW  cad 
this  the  connection  problem.  In  centralized  problem  solvers  It  la 
called  the  invocation  pri Mm-,  that  Is.  which  K3  to  Invoke  at 
any  given  time  for  the  execution  of  a  task.  Because  A! 
applications  do  not  generally  have  well-defined  algorithms  for 
their  solution.  At  problem  solvers  need  considerable  heuristic 
knowledge  to  guide  them,  making  the  connection  problem 
crucial. 


1  This  work  has  been  supported  In  part  by  tha 
Advanced  Research  Protects  Agency  under  contract  MOA 
903- 77-C -0322,  and  the  National  Science  Foundation  under 
contract  MCS  77-02712.  It  has  bean  carried  out  on  the 
SUMEX-AIM  Computer  Facility,  supported  by  tha  National 
Institutes  of  Health  under  grant  AR-00786.  Raid  Smith  Is  now 
at  the  Oefencs  Rs search  Establishment  Atlantic,  Dartmouth, 
Nova  Scotia.  Canada.  Randall  Davis  is  now  at  the  MIT 
Artificial  Intelligence  Laboratory,  Cambridge,  Mass. 


The  Contreet  Net  Fr, 
Control 


The  contract  net  framework  includes  a  pnttem-iotmng 
protocol  [Smith,  1077]  [Smith,  187*0],  an  extrapolation  to 
the  problem-solving  level  of  the  standard  network 
communications  protocol.  This  protocol  encodes  task- 
independent  Information  that  specifies  the  possible  actions 
and  Interactions  of  the  nodes  of  the  problem  salver  and  also 
includes  slots  for  the  task -dependent  Information  necessary 
for  the  decisions  that  guide  the  control.  The  task-dependent 
Information  in  tha  framework  la  encoded  M  a 
intemode-Janguage,  understandable  to  a*  nodes. 


The  problem-solving  protocol  uaes  a 
bid  -  award  seguence  of  contract  negotiation  to  solve  the 
connection  problem.  It  views  task  distribution  as  an 
Interactive  process,  one  that  that  entails  a  discussion 
between  a  node  with  a  task  to  be  executed  and  nodes  that 
may  ba  able  to  execute  the  task.  The  protocol  defines  a  sat 
of  messages  that  has  so  far  proven  adequate  for  both 
control  and  data  distribution. 


A  contract  net  is  a  collection  of  Interconnected 
processor  nodes  whose  Interactions  are  governed  by  a 
problem- solving  protocol  based  on  the  contract  metaphor 
Each  node  m  the  net  operates  asynchronously  and  with 
relative  autonomy.  Tha  execution  of  an  individual  task  is 
nandled  as  a  contract.  A  node  that  ganaratss  a  taak 
advertises  existence  of  that  task  to  tha  other  nodes  m  the 
net  with  a  fee*  announcement,  then  acts  aa  the  manager  of 
that  task  for  Its  duration.  In  the  absence  of  any  Information 
about  the  specific  capabilities  of  tha  other  nodes  m  the  net. 
the  manager  is  forced  to  issue  a  general  broadcast  to  ak  nodes, 
if,  however,  the  manager  possesses  some  knowledge  about 
which  of  the  other  nodes  m  the  net  ere  likely  csnatdttas. 
then  it  esn  issue  a  limited  broadcast  to  Just  those  candldatas. 
Finally,  If  tha  manager  knows  exactly  winch  of  the  other 
nodes  Is  approprtete.  then  it  can  issue  a  fmnt-to-pmnt 
announcement.2  As  work  on  the  problem  progresses,  many 
such  task  announcements  wW  be  made  by  various  managers. 


Nodes  In  the  net  have  been  listening  to  tbs  task 
announcements,  and  have  been  evaluating  their  own  level  of 
interest  M  each  task  with  respect  to  their  specialized 
hardware  end  software  resources.  When  e  task  Is  found  to 
bo  of  sufficient  interest,  s  nods  submits  a  Md.  A  bid 
Indicates  the  capabilities  of  tha  bidder  that  srs  relevant  to 


2  Restricting  the  set  of  sedrsssess  of  an 
announcement  (which  we  can  focused  addratatnq)  is 
typically  s  heuristic  process,  sines  the  informstion  upon 
which  it  Is  based  may  not  be  exact  (e.g..  It  may  be  inferred 
from  prior  responses  to  announctmonts). 
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execution  of  th*  announced  task.  A  manager  may  raeaiva 
savaral  sued  bids  in  raaponaa  to  a  single  taak  announcamant; 
basad  on  tna  information  In  tba  bid*,  it  selects  ona  (or 
savaral)  nodefa)  for  ax acutlon  of  th*  task.  Tha  satactlon  Is 
communicated  to  tha  successful  biddaris)  through  an  award 
massage.  Thasa  select*#  nodes  assume  raaponalbitity  for 
execution  of  tha  task,  and  each  is  caked  a  contractor  for 
that  task. 

A  contract  is  thus  an  explicit  agreement  between  a 
nod*  that  generates  a  task  (the  manager)  and  a  node  that 
executes  the  task  (tha  contractor).  Not*  that  attack* ding  a 
contract  is  a  process  of  mutual  selection.  AvakoM* 
contractors  evaluate  task  announcamant*  made  by  several 
managers  until  they  find  one  of  Intsrest;  tha  managers  than 
evaluate  the  CM*  received  front  potential  contractors  and 
salact  on*  they  detarmm*  to  be  most  approprtats.  Both 
parties  to  tha  sgraamant  nave  evaluated  the  information 
supplied  by  the  other  and  a  mutual  selection  been  made. 

The  contract  negotiation  process  Is  axpeditad  by  thraa 
forms  of  task-dopandant  information  contained  in  a  task 
announcement.  An  eligibility  specification  lists  tha  criteria 
that  a  nods  must  mast  to  ba  eligible  to  submit  s  bid.  This 
specification  raducas  massage  traffic  by  pruning  nodes 
whose  bids  would  ba  clearly  unacceptable.  A  taak 
abatraetton  is  a  brief  description  of  tha  task  to  ba 
executed,  and  akowa  a  potential  contractor  to  evaluate  Its 
level  of  interest  In  executing  this  task  relative  to  others  that 
arm  avakabi*.  An  abstraction  is  usad  rather  than  a  complete 
description  In  ardor  to  reduce  the  length  of  tha  massage 
(and  hanca  massage  traffic).1  Finally,  s  bid  speefftcatfon 
details  tha  expected  form  of  a  bid  for  that  taak.  It  enables  a 
potential  contractor  to  transmit  a  bid  that  contains  only  a 
brief  specification  of  Its  cspabkitlas  that  ara  relevant  to  tha 
task  (thi*  specification  is  called  a  no da  abstraction),  rather 
than  a  complete  description.  This  both  simplifies  the  task  of 
the  managor  in  evaluating  bids,  and  further  raducas  massage 

traffic.3 4 

Contracts  ara  queued  locally  by  the  node  that 
generates  them  untk  they  can  b*  awarded.  If  no  bids  ara 
received  far  a  contract  by  tha  time  an  expiration  time 
(included  in  the  task  announcamant)  has  passed,  than  th* 
contract  is  ra-announcad.  This  process  is  repeated  untk  th* 
contract  can  b*  awarded.1 

Tha  award  massage  contains  a  task  specification, 
which  include*  th*  coaipiata  specification  of  th*  taak  to  ba 
executed.  After  th*  task  has  bean  completed,  th*  contractor 
sands  s  report  to  its  manager.  This  massage  Includes  • 
result  daacrlptlon.  winch  communicates  tha  results  that 
have  bean  achieved  during  execution  of  th*  taak. 

Th*  manager  may  terminate  th*  execution  of  contract* 


3  On*  component  of  tho  abstraction  is  th*  task  tjpt,  or 
generic  classification  of  the  task. 

4  Th*  Information  that  makes  up  th*  eligibility 
specification,  task  abstraction,  and  bid  specification  for  any 
given  example  must  b*  supplied  by  th#  applications 
programmer.  In  Section  3  wo  wkl  sea  examples  of  this  type 
of  information. 

1  Th*  is  a  simplified  version  of  the  actual  procsss  (and 
does  not  work  in  th*  case  of  a  task  that  cannot  ba  executed 
due,  for  example,  to  lack  of  sufficient  data);  sea  [Smith, 
1978c]  for  tha  cemplata  description. 


ss  necessary  (with  a  termination  massage),  and  further 
(sub)contracts  may  b*  let  in  tum  as  required  by  the  size  of  s 
contract  or  by  a  requirement  for  special  expertise  or  data 
that  tha  contractor  does  not  have. 

It  is  important  to  not*  that  individual  nodes  ara  not 
designated  a  priori  as  managers  or  contractors.  These  are 
only  rotes,  and  tny  nod*  can  taka  on  either  role.  During  tha 
course  of  problem  solving,  a  particular  nod*  normally  takas  on 
both  roles  (perhaps  even  simultaneously  for  different 
contracts).  This  IsaOs  to  mors  efficient  utilization  of  nodes, 
as  eoaiparad,  for  sxampl*.  to  sens  mas  that  do  not  snow 
nodes  that  have  contracted  out  suo tasks  to  taka  on  other 
tasks  whk*  they  ara  waiting  for  results.  Individual  nodes, 
then,  are  not  statically  tied  to  th*  control  hierarchy. 

Note  also  that  tha  idea  of  transfer  of  expertise 
between  nodes  (via  tranafar  of  proesdurss  or  data)  can  ba 
readily  handled  by  tha  protocol.  It  can  b*  handed  as  a 
standard  contract  between  a  nod*  that  announces  (In 
affect)  /  ntad  tht  cede  for  « prxtdurt-<lttcnption> ,  and  a  node 
that  bids  on  tho  taak  by  indicating  that  It  ha*  tha  required 
Information. 

To  review,  the  normal  method  of  negotiating  a  contract 
is  for  a  nods  (called  th*  manager  for  a  task)  to  issue  a  task 
announcamant.  Many  such  announcement*  ara  made  over  tha 
course  of  time.  Other  nodes  ara  listening  and  submit  bid*  on 
those  announcamant s  for  which  they  ara  suited.  Tha 
managers  evaluate  tha  bids  and  award  contract*  to  tha  most 
suitable  nodes  (which  are  than  called  contractor*  for  tha 
awarded  tasks). 

Th*  normal  contract  negotiation  process  can  ba 
simplified  fn  soma  instance*,  with  a  resulting  enhancement  in 
tha  efficiency  of  tha  protocol.  If  a  manager  knows  exactly 
which  nod*  is  appropriate  for  execution  of  a  task,  a  directed 
contract  can  ba  awarded.  This  differs  from  tha 
announcad  contract  In  that  no  announcamant  is  made,  and  no 
bids  are  submitted,  instead,  sn  award  is  mad*  directly.  In 
such  eaaas.  nods*  awarded  contracts  must  acknowledge 
racalpt  and  have  the  option  of  refusal. 

Th*  protocol  has  also  boon  designed  to  aHow  ■  reversal 
of  tha  normal  negotiation  process.  Whan  tha  processing  load 
on  th*  net  is  high,  most  task  announcements  wik  not  ba 
answered  with  bids  because  all  nodes  wm  be  already  busy. 
Hanc*  th*  protocol  includes  a  node  availability 
announcement  massage.  Such  a  massage  can  b*  issued  by 
an  idl*  node,  it  is  sn  invitation  for  manager*  to  sand  task 
announcements  or  directed  contracts  to  that  node. 

Finaky,  for  taaka  that  amount  to  siaipfe  requests  for 
Information,  a  contract  may  not  ba  appropriate,  in  such 
cases,  s  request  •  raaponaa  sequence  can  b*  used  without 
furthor  embellishment.  Such  massages  (that  sid  in  tha 
distribution  of  data  as  opoosad  to  control)  ara  implemented 
as  recue  at  and  Information  massages.  Th*  request  massage 
la  usad  to  encode  straightforward  requests  for  information 
for  whlcn  contracting  is  unnecessary.  Tha  information 
maaaag*  >*  usad  both  as  a  ratoons*  to  a  request  message 
and  aa  a  general  data  tranafar  massage. 
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3  Distributed  Ssnatng 

in  this  section,  we  demonstrste  the  use  of  the  contrsct 
net  framework  in  the  solution  of  s  problem  in  ares 
surveillance,  such  as  is  encountered  in  sMp  or  air  traffic 
control.  The  example  will  help  to  demonstrate  the  contract 
net  approach  to  communications  and  control. 

We  consider  the  operation  of  a  network  of  nodes,  each 
having  either  sensing  or  processing  capabilities  and  an 
spread  throughout  a  relatively  large  geographic  area.  Such  a 
network  Is  called  a  Distributed  Sensing  System  (OSS). 

The  primary  aim  of  the  system  la  rapid,  reliable, 
accurate,  and  low-cost  analysis  of  the  traffic  In  a  designated 
area.  This  analysis  Involves  detection,  claaaifleatlan,  and 
tracking  of  vehicles-,  that  Is,  the  solution  to  the  problem  la  s 
dynamic  map  of  traffic  in  the  area.  Construction  and 
maintenance  of  such  a  map  requires  the  interpretation  and 
integration  of  a  large  quantity  of  sensory  Information 
received  by  the  collection  of  sensor  elements. 

There  are  many  trade-offs  involved  in  the  design  of  a 
□SS  architecture.  We  present  only  one  possible  approach 
that  considers  a  limited  number  of  these  tradeoffs.9  The 
primary  intent  of  this  example  is  to  demonstrate  the  contract 
net  approach,  and  we  therefore  focus  on  the  initialization  and 
communications  aspects  of  the  D3S.  For  a  discussion  of  other 
aspects  of  this  problem  see  [HP,  1878], 


3. 1  Hardware 

All  communication  in  the  DSS  is  assumed  to  take  piece 
over  a  broadcast  channel  (using  for  example,  packet  radio 
techniques  [Kahn,  1978]).  The  nodes  are  assumed  to  be  In 
fixed  positions  known  to  themselves  but  not  known  s  priori  to 
other  nodes  in  the  nat.  Each  node  has  one  of  two 
capabilities:  sensing  or  processing.  The  sensing  capability 
includes  low-level  signal  analysis  and  feature  extraction.7 
We  assume  that  a  variety  of  sensor  types  axiat  in  the  OSS, 
that  tha  sensors  are  widely  spaced,  and  that  there  is  some 
overlap  in  sensor  area  coverage.  Nodes  with  processing 
capability  supply  the  computational  power  necessary  to 
effect  the  high-level  analysis  and  control  in  the  net.  They 
are  not  necessarily  near  the  sensors  whose  data  they 
process. 

A  OSS  may  have  several  functions,  ranging  from 
analysis  of  vehicle  data  in  the  overall  area  of  coverage  to 
control  over  the  courses  and  speeds  of  those  vehicles.  We 
consider  here  tha  analysis  function,  in  this  case  the  overall 
area  map  must  be  integrated  at  one  node  in  the  system  (It 
could  also  be  integrated  by  an  agent  outside  the  system,  Uka 
a  monitoring  aircraft).  We  therefore  distinguish  one 
processor  node  as  the  monitor  node,  its  function  is  to  begin 
the  initialization  of  the  OSS.  and  integrate  the  overall  area 
map  for  communication  to  an  agent  outside  the  OSS.  We  wM 
see  that  it  does  not  correspond  to  a  central  controller. 


Figure  3.1  is  a  scnematic  representation  of  a  OSS. 
Processing  nodes  are  shown  in  black,  and  sensing  nodes  in 
white.  The  monitor  node  is  shown  in  white  with  an  ”M*  in  tha 
middle. 
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Figur*  3.1.  A  Distributed  Sensing  System. 


3.2  Data  And  Task  Hierarchy 

The  OSS  must  Integrate  a  large  quantity  of  signal  data, 
reducing  it  and  transforming  it  into  a  symbolic  form  maanmgful 
to  and  useful  to  a  Human  decision  maker.  We  view  this 
process  as  occurring  In  several  stages,  which  together  form 
a  data  hierarchy  (Figure  3.2).  The  hierarchy  offers  an 
overview  of  OSS  functions  and  suggests  a  task  partitioning 
suitable  for  a  contract  net  approach.  A  particular  node  in  the 
DSS  handlea  data  st  only  one  level  of  the  data  hierarchy  st 
any  given  moment,  and  communicates  with  nodes  at  other 
levels  of  tha  hierarchy. 


overall  ares  map 
area  map 


*  Further  discussion  of  the  background  issues  inherent 
in  OSS  design  is  presented  in  [Smith,  1978a], 

7  in  a  real  OSS.  it  is  likely  that  senaors  and  low-level 
signal  analysis  devices  would  not  be  considered  as  statically 
connected  parts,  as  it  is  often  the  esse  that  many  different 
types  of  analysis  are  applied  to  the  output  of  a  single 
sensor,  or  to  that  of  groups  of  sensors  tsxen  together. 


vehicle 
signal  group 
signal 


Figure  3.2.  Data  Hierarchy. 


For  purposes  of  this  example,  the  only  form  of  signal 
processing  we  consider  is  narrow  Band  spectral  analysis.  The 
signal  has  me  following  features:  frequency,  time  of 
detection,  strength,  characteristics  (e.g.,  Increasing  signal 
strength),  name  and  position  of  the  detecting  node,  and 
name,  type,  and  orientation  of  the  detaetmg  sensor.' 

Signals  are  formed  Into  jifitoi  groups  at  the  second 
level  of  the  data  hierarchy.  A  signal  group  is  a  collection  of 
related  signals.9  For  this  example,  the  signal  groups  have  the 
following  features:  the  fundaatentai  frequency  of  the  group, 
the  time  of  group  formation,  and  the  features  of  the  signals  in 
the  group  (as  above). 

The  next  level  of  the  hierarchy  Is  the  description  of  the 
vrkiclt.  It  has  one  or  more  signal  groupa  associated  with  it 
and  is  further  specified  by  position,  speed,  course,  and 
type.'0  Position  can  be  established  by  triangulation,  using 
matching  groups  detected  by  several  sensors  with  different 
positions  and  orientations.  Speed  and  course  must  be 
established  over  time  by  tracking. 

The  aria  map  forms  the  next  level  of  the  data 
hierarchy.  This  map  incorporates  information  about  the 
vehicle  traffic  in  ar<  area,  it  is  an  integration  of  the  vehicle 
level  data.  There  will  be  several  such  amps  for  the  OSS, 
corresponding  to  areas  in  the  span  of  coverage  of  the  net 

The  final  level  is  the  complete  or  otoraU  ana  map.  In 
this  example,  the  map  Is  integrated  from  the  individual  area 
maps  by  the  monitor  node. 

As  indicated  above,  the  hierarchy  of  tasks  follows 
directly  from  the  data  hierarchy.  The  monitor  node  manages 
several  arta  contractors.  These  contractors  are  responsible 
for  the  formation  of  traffic  maps  In  their  Immediate  areas. 
Each  area  contractor.  In  turn,  manages  several  group 
contractors  that  provide  it  with  signal  groupa  for  its  area 
(Figure  3.3).  Each  group  contractor  integrates  raw  signal 
data  from  signal  contractors  that  have  sensing  capabilities. 

The  area  contractors  also  manage  several  srAizfr 


a  The  frequency  spectrum  of  noise  radiated  by  a 
vehicle  typically  contains  narrow  band  signal  components 
that  are  caused  by  rotating  machinery  associated  with  the 
vehicle  (e.g..  engines  or  generators).  The  frequencies  of 
such  signals  are  correlated  with  the  type  of  rotating  machine 
and  its  speed  of  rotation.  They  are  Indicators  of  the 
classification  of  the  vehicle.  Narrowband  signals  also 
undergo  shifts  m  frequency,  due  to  doppler  effect,  or 
instability  and  change  m  the  speed  of  rotation  of  the 
associated  machine.  Alterations  in  signal  strength  also  occur 
as  a  result  of  propagation  conditions  and  venations  In  the 
distance  between  the  vehicle  and  the  sensor. 

9  A  signal  group  that  is  often  used  to  Integrate  narrow 
band  signal  data,  for  example,  is  the  harmonic  set,  a  group  of 
signals  that  are  harmonically  related  (l.e..  the  frequency  of 
each  signal  in  the  group  la  an  integral  multiple  of  the  lowest, 
or  fundamental  frequency).  A  single  rotating  machine  often 
gives  rise  to  several  narrow  band  signals  that  form  a 
harmonic  set. 

10  For  simplicity,  we  have  Ignored  a  level  In  the 
hierarchy  that  can  be  called  component  tourers  of  signal,  aa  In 
[Mil,  taraj.  At  this  level,  a  OSS  would  normally  try  to 
attribute  signals  and  signal  groups  to  particular  pieces  of 
machinery  associated  with  a  vehicle. 


contractors  that  are  responsible  for  integration  of  information 
associated  with  individual  vehicles.  Each  of  these 
contractors  manages:  a  classification  contractor  that 
determines  vehicle  type;  a  localization  contractor,  that 
determines  vehicle  position;  and  a  tracking  contractor,  that 
tracks  the  vehicle  aa  it  passes  through  the  area." 

area 


vehicle 

group 


signal  classification  localization  tracking 

Figure  3.3.  Area  Task  Partitioning. 

Note  that  this  particular  partitioning  of  tasks  Is  only  one 
of  many  posaibilltlas  that  might  be  specified  by  the  system 
designer. 


3.3  Contract  Net  Implementation 

This  section  reviews  in  qualitative  terms  how  the  OSS 
problem  can  be  attacked  using  the  contract  net  approach 
and  illustrates  several  of  the  ideas  central  to  its  operation. 
Appendix  A  gives  specific  examples  of  the  message  traffic 
that  is  described  here. 


3.3.1  fnftiaiizatfon 

The  monitor  node  is  responsible  for  initialization  of  the 
OSS  and  for  formation  of  the  overall  map.  it  must  first  select 
nodes  to  be  area  contractors  and  partition  the  system's  span 
of  coverage  into  areas,  based  on  the  positions  of  those 
selected  nodes.  For  purposes  of  illustration  we  assume  that 
the  monitor  node  knows  the  names  of  nodes  that  are 
potential  area  contractors,  but  it  must  establish  their 
positions  In  order  to  partition  the  overall  span  of  coverage. 
Hence,  it  begins  by  announcing  contracts  for  formation  of 
area  maps  of  the  traffic.  Because  the  monitor  node  knows 
the  names  of  potential  area  contractors,  it  can  avoid  a 
general  broadcast  and  can  instead  use  a  focused  addressing 
scheme.  The  announcement  contains  the  three  components 
described  In  Section  2:  a  task  abstraction,  an  eligibility 
specification,  and  a  bid  specification.  The  task  abstraction  is 
simply  the  task  type,  and  the  eligibility  specification  is  blank 
(because  the  monitor  node  knows  which  other  nodes  are 
potential  contractors  and  addresses  them  directly).  The  bid 
specification  is  of  primary  int  ast  for  this  task.  It  informs  a 
prospective  area  contractor  to  respond  with  its  position. 
Remember  that  the  purpose  of  a  bid  specification  is  to 
enable  a  manager  to  select,  from  ail  of  the  bidders,  the  most 


”  In  a  real  solution  to  the  DSS  problem,  it  is  possible 
that  not  aH  of  these  tasks  would  be  large  enough  to  justify 
the  overhead  of  contracting;  that  is.  some  of  them  might  be 
done  in  a  single  node.  Note  also  that  some  of  the  tasks  in  the 
hierarchy  are  continuing  tasks  (a.g..  tha  area  task),  while 
others  are  ont-umt  tasks  (e.g.,  the  localization  task). 
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appropriate  nodes  to  execute  e  contract  Node  position  is 
tne  information  required  by  the  monitor  node  to  make  that 
selection.  Given  that  information,  the  monitor  node  can 
partition  the  overall  span  of  coverage  into  approximately 
equal-sized  areas,  and  select  a  subset  of  the  bidders  to  be 
area  contractors.  Having  decided  upon  a  partitioning,  the 
monitor  node  broadcasts  an  Information  message  to  the  other 
nodes  in  the  system.  This  message  defines  the  names  and 
specifications  (in  terms  of  latitude  and  longitude  ranges)  of 
the  individual  areas.  Each  selected  area  contractor  la  then 
informed  of  its  area  of  responsibility  In  an  award  message.12 

The  area  contractors'  purpose  is  to  integrate  vehicle 
data  into  area  maps.  They  must  first  establish  the  existence 
of  vehicles  on  the  basis  of  signal  group  data.  Therefore,  each 
area  contractor  solicits  other  nodes  to  provide  signal  group 
data,  in  the  absence  of  any  information  about  which  nodes 
are  suitable,  each  araa  contractor  announces  the  task  using 
a  general  broadcast.  The  task  abstraction  In  these 
announcements  is  the  type  of  task.  The  eligibility 
specification  is  the  area  for  which  the  individual  area 
contractor  is  responsible:  that  is,  a  node  is  only  ailgibla  to 
bid  on  this  task  If  it  is  in  tha  same  area  as  the  announcing 
area  contractor.  This  restriction  helps  to  prevent  a  esse  In 
which  a  signal  group  contractor  is  so  far  away  from  its 
manager  that  reliable  comaiunication  is  difficult  to  achieve. 
The  bid  specification  is  again  node  position.  Potential  group 
contractors  respond  with  their  respective  positions,  and, 
based  on  this  information,  the  area  contractors  award  signal 
group  contracts  to  nodes  in  their  areas  of  responsibility. 

The  signal  group  contractors'  task  Is  to  integrate  signal 
data  from  sensor  nodes  into  signal  groups.  Therefore,  they 
must  first  find  nodes  that  will  provide  raw  signal  data.  This  is 
dona  with  signal  task  announcements.  The  eligibility 
specification  in  these  announcements  Indicates  that  only 
those  nodes  located  in  the  same  area  as  the  announcer  and 
having  sansing  capabilities  should  bid  on  thta  task.  The  task 
abstraction  indicates  the  task  type  and  position  of  sn 
individual  signal  group  contractor.  This  information  assists 
potential  signal  contractors  in  determining  the  group 
contractors  to  which  they  should  respond.13 

The  potential  signal  contractors  listen  to  the  task 
announcements  from  the  various  group  contractors.  They 
respond  to  the  nearest  group  contractor  with  a  bid  that 
supplies  their  position  and  a  description  of  their  sensors. 
The  group  contractors  use  this  information  to  select  a  set  of 
bidders  that  covers  their  immediate  vicinity  with  a  suitable 
variety  of  sensors,  and  then  award  signal  contracts  on  this 
basis.  The  awards  specify  the  sensors  that  each  signal 
contractor  is  to  use  to  provide  raw  data  to  Its  managing 
group  contractor.  Figure  3.4  depicts  the  exchange  between 
one  group  contractor  (the  black  nods),  and  several  potential 
signal  contractors  (the  white  nodes).  Successful  bidders  are 


12  The  full  announcament-bld-award  sequence  Is 
necessary  (rather  than  a  directed  contract)  because  the 
monitor  node  needs  to  know  the  positions  of  all  of  the 
potential  araa  contractors  in  order  to  partition  the  overall 
span  of  coverage  of  the  OSS  into  manageable  areas.  Note 
that  this  means  that  the  OSS  can  adjust  to  a  change  in  the 
number  or  position  of  potential  area  contractors. 

13  The  signal  task  is  to  detect  signals  and  report  them 
to  a  signal  group  contractor  in  a  particular  position.  Hence 
the  position  of  the  group  contractor  is  reasonable  as  part  of 
the  abstraction  for  the  task. 


connected  by  solid  lines  to  the  group  contractor  and 
unsuccessful  bidders  are  connected  by  dashed  lines. 


Figure  3.4.  Signal  Contract  Negotiation. 

There  are  some  potential  problems  of  asynchrony  in  the 
receipt  of  announcements  from  group  contractors.  A 
potential  contractor  for  signal  tasks  must  determine  the 
group  contractor  that  is  closest  to  it  by  listening  to  several 
signal  task  announcements.  Tha  potential  signal  contractors 
use  the  expiration  times  of  the  announcements  (l.e.,  tha 
times  after  which  no  further  bids  will  be  accepted)  as  a 
guide  to  the  length  of  time  during  which  they  can  listen  to 
announcements  bafore  submitting  a  bid.  In  the  best  case,  the 
expiration  times  are  long  enough  to  allow  announcements 
from  group  contractors  to  reach  all  local  potential  signal 
contractors  so  that  optimum  partitioning  can  be  achieved.  In 
the  worst  case,  however,  a  potential  signal  contractor  may 
submit  a  bid  to  a  group  contractor  that  is  not  the  closest  one 
to  It  (because  tha  task  announcement  of  the  closest  group 
contractor  is  not  received  until  after  a  bid  haa  already  been 
submitted  to  another  group  contractor).  The  result  is  a  sub- 
optimal  partitioning. 

The  signal  contract  is  a  good  example  of  the  contract 
negotiation  process,  illustrating  how  the  matching  of 
contractors  to  managers  is  an  interactive  process.  It 
Involves  a  mutual  dacision  based  on  local  processing  by  both 
the  group  contractors  and  the  potential  signal  contractors. 
The  potential  signal  contractors  base  their  decision  on  a 
distance  metric  and  respond  to  the  closest  manager.  Tha 
group  contractors  use  the  number  of  sensors  and  distribution 
of  sensor  types  observed  in  tha  bids  to  select  a  set  of  signal 
contractors  that  ensures  that  every  area  is  covered  by 
every  kind  of  sensor.  Thus  each  party  to  tha  contract 
evaluates  the  proposals  made  by  the  other,  using  a  different 
evaluation  function,  and  a  task  distribution  agreement  is 
completed  via  mutual  selection. 

Reviewing  the  status  of  the  OSS.  we  have  a  single 
monitor  node  which  manages  several  area  contractors.  Each 
area  contractor  manages  several  group  contractors,  and 
each  group  contractor  manages  several  signal  contractors. 
The  data  initially  flows  from  the  bottom  to  the  top  of  this 
hierarchy.  The  signal  contractors  supply  raw  signal  data; 
each  group  contractor  integrates  the  raw  data  from  several 
signal  contractors  to  form  a  signal  group,  and  these  groups 
are  passed  along  to  the  area  contractors,  which  eventually 
form  area  maps  by  integrating  information  based  on  the  data 
from  several  group  contractors.  All  the  area  maos  are  then 
passed  to  the  monitor  which  forms  the  final  traffic  map. 

As  we  have  noted,  in  this  example  one  area  contractor 
manages  several  group  contractors  and  each  group 
contractor  in  turn  manages  several  signal  contractors,  it  is 
possible,  however,  that  a  single  group  contractor  should 
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supply  information  to  several  araa  contractors,  and  a  single 
signal  contractor  should  supply  information  to  savaral  group 
contractors,  it  may  be  useful,  for  instance,  to  have  a 
particular  group  contractor  near  an  araa  boundary  report  to 
the  area  contractors  on  both  sides  of  the  boundary.  This  Is 
easily  accommodated  within  our  framewonc. 


3.3.2  Comments  On  The  OSS  Organization 

We  have  taken  a  top-down,  distributed  approach  to 
initializing  the  OSS.  An  alternative  approach  might  involve 
acquisition  of  the  positions  of  all  nodes  at  a  very  early  stage, 
followed  by  aree  definition,  award  of  area  contracts,  and  so 
on.  This  would  involve  a  more  global  approach  to  the  proMem 
of  initialization,  using  a  single  node  thet  initialized  the  net  by 
gathering  together  all  the  necessary  data.  We  have  not 
pursued  this  approach  for  several  reasons,  primarily  because 
it  would  tell  us  little  about  solving  problems  in  a  distributed 
manner.  An  underlying  theme  of  this  research  is  a  search  for 
ways  in  which  to  effect  distributed  problem  solving— rather 
than  ways  to  do  traditional  problem  solving  In  a  distributed 
architecture. 

Moreover,  there  are  two  practical  dlfflcultlas  with  the 
global  approach.  First,  it  concentrates  a  large  amount  of 
message  traffic  and  processing  at  a  single  node  (say  the 
monitor  node)  because  such  a  node  would  be  responsible  for 
accepting  position  messages  from  every  other  node  in  the 
net.  Second,  it  may  not  be  possible  for  any  single  node  to 
communicate  directly  with  all  other  nodes  in  a  widely 
separated  collection.  This  would  mean  that  either  indirect 
routing  of  messages  would  be  required  for  communication,  or 
that  each  of  the  nodes  would  require  powerful  transmitters. 
This  is  one  of  the  advantages  of  the  distributed  and 
dynamically-defined  organization  we  have  adopted:  only  pairs 
of  nodes  that  are  close  together  enough  to  comaiunicate 
directly  are  linked  together  with  contracts. 


3.3.3  Operation 

We  now  consider  the  activities  of  the  system  as  it 
commences  operation. 

When  a  signal  is  detected  or  when  a  change  occurs  in 
the  features  of  a  known  signal,  the  detecting  signal 
contractor  reports  this  fact  to  its  manager  (a  group 
contractor)  (Figure  3.3).  This  node.  In  turn,  attempts  to 
integrate  the  information  into  an  existing  signal  group  or  to 
form  a  new  signal  group. 
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Figure  3.9.  Signet  Contract  Reporting. 

A  group  contractor  reports  the  existence  of  a  new 
signal  group  to  its  manager  (an  area  contractor)  (Figure  3.6). 
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Figure  3.6.  Sroup  Contract  Reporting. 

Whenever  a  new  group  is  detected,  the  managing  area 
contractor  attempts  to  find  a  node  to  execute  a  vehicle 
contract  (Figure  3.7).  The  task  of  a  vehicle  contractor  is  to 
classify,  localize,  and  track  the  vehicle  associated  with  the 
signal  group.  Since  a  newly  detected  signal  group  may  be 
attributable  to  a  known  vehicle,  the  area  contractor  first 
requests  from  the  existing  collection  of  vehicle  contractors  a 
measure  of  confidence  in  the  fact  that  the  new  group  is 
attributable  to  one  of  the  known  vehicles.  Saaed  on  these 
responses,  the  area  contractor  either  starts  up  a  new 
vehicle  contractor  or  augments  the  existing  contract  of  the 
appropriate  existing  vehicle  contractor,  with  the  task  of 
making  certain  that  the  new  group  corresponds  to  a  known 
vehicle.  This  may  entail,  for  axample,  the  gathenng  of  new 
data  via  the  adjustment  of  sensors  or  contracts  to  new 
sensor  nodes. 
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Figure  3.7.  Vehicle  Contract  Initiation. 

The  form  of  the  signal  group  confirmation  request 
demonstrates  a  tradeoff  that  arises  in  many  distnbuted 
problem  solving  applications— a  tradeoff  between 
communication  and  local  processing.  The  area  contractor  has 
the  option  of  transmitting  to  the  existing  vehicle  contractors 
either  the  complete  signal  group  or  an  abstraction  of  it  (e.g., 
its  fundamental  frequency),  in  either  case,  the  response  to 
the  request  is  a  measure  of  the  individual  vehicle 
contractor's  confidence  that  the  group  corresponds  to  a 
vehicle  it  knows  about.  If  a  vehicle  contractor  returns  a  high 
confidence  measure  in  its  bid.  then  in  the  first  case 
(complete  signal  group  announced)  the  araa  contractor  has 
the  Informetton  It  wants.  In  the  second  case,  however,  the 
aree  contractor  must  still  transmit  the  complete  signal  group 
description  and  await  a  further  report. 

The  first  approach  has  the  disadvantage  of  using  up 
more  local  processing  time  in  each  of  the  vehicle  contractors 
than  does  the  second,  in  that  the  original  request  message  is 
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more  complex.  The  question  of  interest  Is,  Under  what 
condition!  shouid  a  compitu  signal  group  ditcnptum  tn  announctd 
insttad  of  an  abstraction*  The  answer  appears  to  depend  on 
the  quality  o t  the  abstraction.  If  the  abstraction  Is  good 
enough  that  the  vehicle  contractors  are  able  to  make 
definitive  statements  on  the  basis  of  its  use,  then  the 
second  approach  is  likely  better  than  the  first,  in  that  It 
minimizes  local  processing  time.  On  the  other  hand.  If  the 
abstraction  does  not  allow  definitive  statements  to  be  made, 
then  its  use  will  result  in  increased  massage  traffic  and  local 
processing  time,  since  the  area  contractor  will  not  be  certain 
as  to  the  best  course  of  action  as  a  result  of  uncertain 
responsea  from  the  vehicle  contractors. 

The  vehicle  contractor  then  makes  two  task 
announcements:  vehicle  classification  and  vehicle 

localization.  The  task  abstraction  of  the  classification  task 
announcement  Is  a  list  of  the  fundamental  frequencies  of  the 
signal  groups  currently  associated  with  the  vehicle.  This 
information  may  help  a  potential  classification  contractor 
select  an  appropriate  task  (a  contractor  may.  for  example, 
already  be  familiar  with  vehicles  that  have  signal  groups  with 
the  announced  fundemental  frequencies).  The  eligibility 
specification  is  blank.  The  bid  specification  Indicates  that  a 
bidder  shouid  respond  with  a  tentative  classification  and  an 
associated  confidence  measure.  This  measure  is  used  to 
select  a  classification  contractor— the  bidder  with  the 
highest  confidence  is  chosen.  The  award  Is  the  complete 
current  description  of  the  vehicle. 

A  classification  contractor  may  be  able  to  classify 
directly,  given  the  signal  group  information;  or,  on  the  other 
hand,  it  may  require  more  data,  in  which  case  It  can 
communicate  directly  with  the  appropriate  sensor  nodes 
(Figure  3.8). 
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Figure  3.8.  Classification  Contract  Communication. 

The  task  abstraction  for  a  localization  task 
announcement  (Figure  3.9)  is  a  list  of  positions  of  the  nodes 
that  have  detected  a  vehicle.  The  eligibility  specification  and 
bid  specification  are  plank.  The  bid  is  simply  an  affirmative 
response  to  the  announcement  and  the  contract  Is  awarded 
to  the  first  bidder,  which  does  the  required  thangulation  to 
obtain  the  position  of  the  vehicle. 
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Figure  3.9.  Localization  Contract  Initiation. 

Once  the  vehicle  has  been  localized,  it  must  be 
tracked.  This  Is  handled  by  the  vehicle  contractor  by 
entering  into  follow-up  localization  contracts  from  time  to 
time  and  using  the  results  to  update  its  vehicle  description 
(Figure  3.10).  As  an  alternative,  the  area  contractor  could 
award  separate  tracking  contracts.  The  decision  as  to  which 
method  to  use  depends  on  loading  and  communications.  If, 
for  example,  the  area  contractor  is  very  busy  with 
integration  of  data  from  many  group  contractors,  then  It 
seems  more  appropriate  to  isolate  it  from  the  additional  load 
of  tracking  contracts.  If,  on  the  other  hand,  the  area 
contractor  Is  not  overly  busy,  than  we  can  let  it  handle 
updated  vehicle  contracts,  taking  advantage  of  the  fact  that 
it  is  in  the  best  position  to  integrate  the  results  and  co¬ 
ordinate  the  efforts  of  multiple  tracking  contractors.  In  this 
example,  we  assume  that  the  management  load  would  be  too 
large  for  the  area  contractor. 
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Figure  3.10.  Tracking  Contract  Initiation. 

There  are  a  variety  of  other  issues  that  must  be 
considered  in  the  design  and  operation  of  a  real  distributed 
sensing  system.  Most  of  these  are  quite  specific  to  the  OSS 
application,  and  would  take  us  away  from  our  primary  concern 
with  the  use  of  the  contract  net  framework.  One  issue, 
however,  presents  an  interesting  example  of  the  relative 
utility  of  the  use  of  contracting  compared  to  the  use  of 
Information  messages.  Consider  the  case  of  a  vehicle  that  is 
moving  from  the  area  of  one  area  contractor  to  that  of 
another.  How  is  responsibility  for  trsckfng  the  vehicle  to  be 
transferred? 

There  are  two  possibilities.  First,  the  area  contractor 
that  is  currently  responsible  for  the  vehicle  could  send  out 
an  information  message  to  neighboring  area  contractors.  This 
message  would  serve  to  set  up  expectations  in  the 
neighboring  area  contractors  that  a  vehicle  was  about  to 
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pass  into  ona  of  their  areas.  The  processing  Is  fully 
decentralized  in  this  approach. 

Second,  the  area  contractor  currently  responsible  for 
the  vehicle  could  send  a  report  to  its  manager  (the  monitor 
node)  that  contains  the  same  information.  The  monitor  node 
could  then  award  a  new  contract  to  one  of  the  neighboring 
area  contractors  to  handle  the  vehicle  when  it  passes  out  of 
the  original  area.  This  is  a  hierarcnicai  control  approach  to 
the  problem,  it  entails  more  (centralized)  processing  by  the 
monitor  node. 

With  the  first  approach  there  is  no  guarantee  that 
another  area  contractor  win  pick  up  on  the  Information 
massage  and  immediately  take  responsibility  for  the  vehicle, 
where  the  second  approach  does  provide  such  a  guarantee. 
The  lack  of  guarantee  is  not  generally  serious,  but  may  result 
in  excess  processing  because  the  vehicle  must  be  re- 
detected  in  the  new  area,  classified,  localized,  and  so  on. 
The  tradeoff  is  thus  one  of  more  processing  by  the  monitor 
node  against  (possibly)  more  processing  by  nodes  in  the  new 
area.  In  the  simulation  we  have  used  the  hierarchical  control 
approach  bacausa  of  Its  guarantae  of  transfer  of 
responsibility. 


4  Conclusion 

The  use  of  the  contract  net  framework  in  the  OSS 
enables  the  implementation  of  a  dynamic  configuration, 
depending  on  the  actual  positions  of  sensor  end  processor 
nodes  and  the  esse  with  which  communication  can  be 
established.  Such  a  configuration  offers  a  significant 
operational  improvement  over  a  static  a  priori  configuration; 
specifically,  it  ensures  that  nodes  that  must  cooperate  for 
the  solution  of  the  area  surveillance  problem  are  able  to 
communicate  with  each  other.  This  avoids  the  necessity  for 
either  indirect  routing  of  messegas  or  powerful  transmitters 
for  ail  nodea. 

The  distributed  control  enabled  by  the  framework 
enhances  both  reliability  and  equalization  of  the  processing 
load.  It  is  also  possible  to  recover  from  the  failure  of  nodes, 
because  there  are  explicit  links  between  nodes  that  share 
responsibility  for  execution  of  tasks  (managers  and 
contractors).  The  failure  of  a  signal  contractor,  for  example, 
can  be  detected  by  its  manager,  and  the  contract  for  which 
it  was  responsible  can  be  re-announced  and  swarded  to 
another  node. 

The  framework  is  also  advantageous  for  this  prooiera 
because  it  enables  addition  of  new  nodes  to  the  net,  even 
after  operation  has  commenced.  This  is  possible  for  two 
reasons.  First,  the  nodas  communicate  in  a  common  protocol. 
This  protocol  enables  a  new  node  to  interpret  the  task- 
independant  portions  of  massages  (e.g.,  that  a  particular 
message  specifies  a  task  to  be  executed).  Second,  the 
protocol  is  augmented  by  a  common  mternode  language.  This 
enables  a  new  node  to  identify,  for  example,  the  specific 
information  that  it  must  have  to  execute  a  particular  task. 
The  protocol  and  language  also  enable  a  new  node  to  request 
this  information  from  other  nodes  in  the  net. 

We  have  shown  the  effectiveness  of  interactive  mutual 
decisions  (by  those  with  tasks  to  be  executed  and  those  m 
a  position  to  execute  the  tssks)  in  distributing  tasks 


throughout  the  system.  This  was  used,  for  example,  in 
setting  up  signal  contracts.  The  potential  signal  contractors 
used  the  information  in  the  signal  task  announcements  to 
select  the  closest  managers  to  wmch  to  respond.  The 
managers  in  turn  used  the  number  of  sensors  end  the 
distribution  of  sensors  in  the  returned  bids  to  make  a  final 
selection  of  signal  contractors. 

We  have  also  noted  the  ways  in  which  tha  contract  net 
protocol  helps  to  reduce  massage  traffic  and  message 
processing  overhead— through  the  use  of  task  abstractions, 
eligibility  specifications,  and  bid  apoctflcatlona  in  task 
announcements,  through  the  use  of  focused  addressing,  and 
through  the  use  of  specialized  Interactions  like  directed 
contracts  and  requests. 

The  contract  net  framework  is  in  general  applicable  to 
problems  that  use  a  hierarchy  of  tasks  and  levels  of  data 
abstraction.  The  manager-contractor  structure  provides  a 
natural  way  to  effact  hierarchical  control  (In  the  distributed 
case.  it‘s  actually  concurrent  hierarchical  control),  and  the 
managers  at  each  level  in  the  hierarchy  are  an  appropriate 
place  for  data  integration  and  abstraction.  It  should  also  be 
noted  that  the  control  hierarchies  in  the  contract  nat 
framework  are  not  simple  vertical  hierarchies,  but  are  the 
more  complex  generalized  hierarchies  discussed  by  [Simon, 
I960].  The  manager-contractor  links  are  not  the  only  means 
of  communication.  Nodes  are  able  to  communicate  horizontally 
with  ralatad-contractors  or  Indeed  any  other  nodes  in  tha 
nat,  as  we  saw  in  the  DSS  example,  where  classification 
contractors  were  able  to  communicate  directly  with  signal 
contractors. 

The  announcement  -  bid  •  award  saquanca  of  contract 
negotiation  enables  more  information,  and  more  complex 
information  to  be  transferred  in  both  directions  (between 
caller  and  respondent)  before  KS-mvocation  occurs.  The 
computation  devoted  to  the  selection  process,  based  on  tha 
information  transfer  noted  above,  is  more  extensive  and  more 
complex  than  that  used  in  traditional  approaches,  and  Is  Ixol 
in  the  sense  that  selection  is  associated  with  and  specific  to 
an  individual  KS  (rather  than  embodied  in,  say,  a  global 
evaluation  function).  As  a  result,  the  framework  is  most 
useful  when  the  specific  KS  to  be  invoked  at  any  time  is  not 
known  a  priori,  and  whan  specific  expertise  is  reaulred. 

It  also  follows  that  the  framework  is  also  primarily 
applicable  to  domains  in  which  the  subtasks  are  large  (in  the 
loose-coupling  sansa),  and  in  which  it  is  worthwhile  to 
expend  a  potentially  non-tnviai  amount  of  computation  and 
communication  to  invoke  the  best  KSs  for  each  subtask,  so 
as  to  pravent  backtracking  aa  much  as  possible. 


Appendix  A 
□S3  Semple  Massages 


This  appendix  induces  abbreviated  sample  messages 
for  the  signal  task  in  the  OSS  axampie.  For  brevity,  the 
messages  shown  contain  only  the  information  mentioned  in 
Section  2.  Terms  written  m  upper  case  are  includeo  in  the 
core  internode  language,  while  terms  written  m  lower  case 
are  soaciflc  to  the  OSS  application. 

Italicized  statements  are  commentary  about  the 
content  anb  sequence  of  messages. 
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1.  INTRODUCTION  AND  OUR  APPROACH 


There  is  a  large  class  of  applications  that 
seem  naturally  suited  to  the  use  of  distributed 
processing  hardware  for  which  adequate  distributed 
problem  solving  methodologies  have  not  yet  been 
developed.  Many  of  these  applications  occur  in 
situations  that  have  a  natural  spatial  distribu¬ 
tion  (e.g.,  where  sensors  for  collecting  raw  data 
are  widely  distributed  and/or  mobile)  where  the 
development  and  execution  of  a  distributed  control 
strategy  based  on  a  non-local  view  of  sensory  data 
is  required.  In  these  applications,  a  distributed 
architecture  which  locates  processing  capability 
at  the  sensor  site  and  which  requires  only  limited 
communication  among  processors  is  especially  ad¬ 
vantageous  and  perhaps  is  the  only  practical  way 
to  perform  the  processing.  Sensor  networks  (e.g., 
low  flying  aircraft,  hydrophones,  etc.) ,  distribu¬ 
ted  command  and  control  systems,  air  traffic  con¬ 
trol,  automotive  traffic  light  control,  inventory 
control  (e.g.,  car  rentals),  power  network  grids, 
and  tasks  involving  mobile  robots  are  all  examples 
of  this  type  of  application. 

The  central  theme  of  our  research  is  that  the 
exact  emulation  of  a  centralized  approach  to  these 
applications  in  distributed  environments  is,  in 
general,  inappropriate  due  to  the  high  communica¬ 
tion  and  synchronization  costs  required  to  main¬ 
tain  sufficient  and  consistent  local  views  of  the 
overall  problem  solving  data  base.  However,  if 
one  is  led  to  emulating  the  centralized  approach 
only  approximately,  inter-node  communication  can 
be  reduced,  but  at  the  risk  of  inconsistency  and 
incompleteness  of  local  views  and  the  possibility 
of  unnecessary,  redundant,  or  incorrect  processing. 
The  key  issue  is  whether  an  acceptable  design  ex¬ 
ists  which  has  sufficiently  low  conminication 
costs  and  which  still  allows  the  overall  system 
performance  to  be  accurate  and  efficient  enough  to 
satisfy  the  task  requirements. 

The  long  range  goal  of  our  research  is  to 
develop  a  general  theory  for  "functionally  accu¬ 
rate"  problem  solving  structures  in  which  all  in¬ 
termediate  aspects  of  the  computation  are  required 
to  be  correct  and  consistent. 

Such  a  general  theory  is  also  important  to 
the  implementation  of  complex  applications  in  cen¬ 
tralized  environments.  These  applications  are  of¬ 
ten  organized  in  the  form  of  a  collection  of  inde¬ 
pendent  modules.  In  such  a  structure  it  is  often 
conceptually  difficult  to  develop  and  expensive  to 


maintain  a  complete  and  consistent  centralized 
problem  solving  data  base  with  which  the  modules 
interact.  A  theory  which  permits  relaxation  of 
completeness  and  consistency  requirements  Mould 
be  a  significant  aid  in  the  development  and  main¬ 
tenance  of  these  systems. 

Cne  example  of  algorithmic  structures  which 
exhibit  functionally  accurate  behavior  are  those 
that  involve  the  incremental  aggregation  of  par¬ 
tial  results  into  an  overall  solution.  "Knowledge- 
based"  Artificial  Intelligence  (AI)  interpreta¬ 
tion  algorithms  developed  recently  for  speech,  im¬ 
age,  and  signal  interpretation  applications  [Erman 
and  Lesser  1975,  Barrow  and  Tenebaum  1976,  Ros en¬ 
fold,  Hummel,  and  Zucker  1976,  Rubin  and  Reddy 
19771  use  this  incremental  aggregation  approach. 
Errors  and  uncertainty  in  input  data  and  from  in¬ 
complete  or  incorrect  knowledge  are  resolved  as  an 
integral  part  of  the  Interpretation  process  by 
testing  for  inconsistency  among  partial  results. 
This  is  in  contrast  to  more  conventional  tech¬ 
niques  for  error  recovery  in  which  errors  are  han¬ 
dled  as  exceptional  conditions,  requiring  proces¬ 
sing  outside  the  normal  problem  solving  strategy.' 

The  ability  to  handle  errors  within  the  nor¬ 
mal  problem  solving  process  itself  creates  the 
possibility  that  these  systems  will  also  handle 
the  additional  uncertainty  introduced  by  a  dis¬ 
tributed  decomposition:  the  restricted  and  incon¬ 
sistent  local  view  of  the  overall  problem  solving 
data  base  at  each  node.  To  the  degree  that  sys¬ 
tems  can  handle  these  problems ,  the  need  for  a 
coapjete  and  consistent  local  view  of  the  overall 
data  base  and  for  explicit  synchronization  to 
maintain  such  a  view  is  reduced. 

Preliminary  research  in  the  development  of 
functionally  accurate  cooperative  distributed 
problem  solving  techniques  (discussed  in  Section 
3)  has  produced  encouraging  results  which  we  feel 
justify  further  explorations.  As  outlined  in  Sec¬ 
tion  4  on  current  research  directions,  we  feel 
generic  research  in  the  areas  of  distributed 
search  and  distributed  planning  will  be  important 
for  the  development  of  a  general  theory  of  func¬ 
tionally  accurate  cooperative  distributed  systems. 

2.  STATE  OF  THE  ART  AND  OUR  APPROACH 

Most  distributed  processing  systems  work  in 
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Local  data  bases  which  contain  exact  copies  of 
appropriate  portions  of  the  overall  problem  solv¬ 
ing  data  base.  We  call  this  type  of  distributed 
processing  decomposition  nearly  autonomous .  be¬ 
cause  each  processor  usually  has  the  information 
it  requires  to  complete  processing;  a  processor 
rarely  needs  the  assistance  of  another  processor 
in  carrying  out  its  problem  solving  function. 

In  nearly  autonomous  systems ,  the  problem 
solving  data  base  is  distributed  in  such  a  way 
that  the  centralized  problem  solving  algorithms 
are  not  significantly  perturbed.  Therefore,  the 
basic  algorithms  and  control  structures  which  are 
appropriate  in  a  centralized  environment  do  not 
need  to  be  modified  (decomposed)  for  the  distribu¬ 
ted  environment,  but  rather  can  be  replicated  or 
partitioned  based  on  the  distribution  of  the  prob¬ 
lem  solving  data  base. 

Where  more  complex  control  structures  have 
been  introduced  in  these  distributed  systems, 
their  use  has  been  directed  at  maintaining  data 
base  consistency  (synchronization,  deadlock  detec¬ 
tion  and  avoidance)  and  error  recovery.  This  re¬ 
sults  in  a  two-level  structure  in  which  the  more 
complex  distributed  control  structures  are  super¬ 
imposed  on  the  basic  computational  algorithms 
needed  to  perform  the  application  processing  [Es- 
wanan  ec  al  1976,  Thomas  1976,  Lamp son  and  Sturgis 
1977,  Peebles  1977], 

in  alternative  and  new  approach  to  structur¬ 
ing  distributed  problem  solving  syscasis  is  the 
development  of  functionally  accurate  cooperative 
distributed  systems.  In  cooperative  distributed 
systems,  additional  complexity  over  the  nearly- 
autonomous  structure  as  required  because  the  algo¬ 
rithms  and  control  structures  are  decomposed  to 
operate  on  local  data  bases  which  are  incomplete 
and  possibly  errorful.  In  order  to  resolve  the 
uncertainties  in  these  incomplete  local  data  bases , 
processors  must  exchange  partial  results  with  each 
ocher.  Since  new  information  nay  also  be  based  on 
processing  which  used  incomplete  or  incorrect  data, 
an  iterative,  coroutine  type  of  processor  interac¬ 
tion  i3  required.  If  a  processing  element  does 
not  receive  an  appropriate  partial  result  in  a 
given  amount  of  time,  it  must  be  able  to  continue 
processing,  utilizing  whatever  data  are  available 
at  that  time. 

A  side  effect  of  this  type  of  interaction  is 
that  local  problam  solving  tasks  in  the  coopera¬ 
tive  decomposition  are  not  disjoint  as  in  the 
nearly-autonooous  structure.  Aether,  the  tasks 
overlap  in  the  data  they  need  and  produce,  forming 
a  "cooperative  network  of  tasks"  which  collective¬ 
ly  define  the  processing  needed  to  solve  the  over¬ 
all  task.  This  perspective  shifts  the  view  from 
chat  of  a  system  distributed  over  a  network  to 
that  of  a  functionally  accurate  network  of  cooper¬ 
ating  systems,  each  of  which  can  perform  signifi¬ 
cant  local  processing  on  incomplete  and  inconsis¬ 
tent  information.  From  this  viewpoint,  the  key 
issue  becomes  how  to  introduce  new  communication 
paths  to  handle  the  inter-node  interactions  need¬ 
ed  for  sharing  the  information  reouired  to  correct 


tor  incomplete  and  inconsistent  local  views. 

This  perspective  is  essentially  identical  to 
the  theory  of  “nearly  decomposable  systems"  de¬ 
vised  by  Simon  [1962,  1969)  to  describe  complex 
organizational  structures.  The  term  "nearly  de¬ 
composable"  emphasizes  the  fact  that  systems  can 
be  decomposed  naturally  into  clusters  which  have 
a  high  degree  of  intra-cluster  interaction  and  a 
lower  degree  of  inter-cluster  interaction. 

Another  effect  of  non- localized  processing  in 
cooperative  systems  is  that  reliability  and  flexi¬ 
bility  issues  cannot  be  addressed  solely  at  the 
hardware  and  communication  level  (syntactically) 
[11,  but  must  also  be  dealt  with  as  an  integral 
part  of  the  basic  problem  solving  process  (seman¬ 
tically)  .  In  a  nearly-autonomous  system,  it  is 
easier  to  detect,  isolate,  and  recover  from  an 
error  since  propagation  of  the  error  can  be  more 
easily  determined  (due  to  the  simpler  interaction 
structure)  and  appropriate  recomputation  performed. 
This  is  in  contrast  with  cooperative  systems  where 
incorrect  partial  results  might  be  propagated  ex¬ 
tensively  through  the  system  before  detection  if 
hot  corrected  as  part  of  the  normal  problem  solv¬ 
ing  process. 

No  software  technology  for  building  coopera¬ 
tive  distributed  problem  solving  systems  which  ex¬ 
hibit  functionally  accurate  behavior  has  yet  been 
developed.  Current  methodologies  for  distributed 
applications,  such  as  distributed  data  basas,  are 
not  directly  applicable  to  cooperative  applica¬ 
tions  due  to  the  relatively  complex  interaction 
required  for  cooperative  tasks.  These  methodolo¬ 
gies  also  fail  to  deal  with  some  Issues  crucial 
to  cooperative  distributed  computation .  sucn  as 
working  with  incomplete  and  inconsistent  data. 

3.  PAST  AND  PRESENT  WORK 

Under  previous  support  from  AREA  [21  and 
current  support  from  MSF.  a  number  of  functionally 
accurate  computational  techniques  (e.g.,  relaxa¬ 
tion  and  distributed  hypothesize-and-test  ala 
Hearsay  IZ)  developed  in  the  framework  of  know- 


(11  Work  has  also  bean  performed  on  developing 
comunication  protocols  that  are  robust  In  the 
face  of  communication  and  site  failures.  [Cerf 
and  Kahn  1974.  Kimbleton  and  Schneider  1975.  Ta- 
jibnapis  1976)'  and  on  embedding  message  passing 
orotocols  into  existing  programing  languages  and 
operating  systems  [Farber  et  al  1973,  Feldman 
1977,  Fosdick,  Schantz,  and  Thomas  1977.  jeffsrson 
1977] .  When  viewed  from  the  perspective  of  the 
routing  task  alone,  some  algorithms  used  to  deter¬ 
mine  message  paths  in  a  comunication  network  have 
a  cooperative  problem  solving  character  to  them 
[Gerla  1973,  Tajibnapis  1977], 


[2!  This  was  joint  -work  with  lee  Sraan  at 
Carnegie-Meilon  University. 


ledge-based  AX  systems  at*  being  suitably  aodifisd 
for  use  in  distributed  problem  solving  environ¬ 
ments.  In  order  to  test  whether  these  modified 
techniques  have  relevance  to  distributed  problem 
solving,  s  number  of  distributed  applications 
using  these  techniques  are  being  developed. 

3.1  DISTRIBUTED  AUTOMOTIVE  TRMTIC  LIGHT  CONTROL 

One  study  has  concentrated  on  investigating 
the  suitability  of  relaxation  as  the  basis  of  a 
cooperative  distributed  approach  to  automotive 
traffic  light  control.  Parallel  relaxation  has 
been  explored  in  previous  work  in  image  processing 
but  to  our  knowledge  it  has  not  been  explored  in 
distributed  domains  which  have  ill-behaved  com¬ 
parability  relationships  between  nodes  and  non- 
localized  interactions  among  nodes  in  chs  network. 

A  test  program  was  written  which  simulates  a 
distributed  relaxation  version  of  traffic  control 
in  which  the  knowledge  applied  at  each  node  is 
similar  to  that  used  in  a  standard  centralized 
traffic  control  system  called  3IG0P-II  [Laiberman 
and  Woo  19761 .  The  simulations  have  produced  en¬ 
couraging  results  for  a  number  of  test  cases. 
However,  in  a  number  of  cases  the  parallel  schema 
does  not  perform  as  wall  as  the  centralized  ver¬ 
sion,  resulting  in  slow  convergence,  convergence 
to  near  optimal  solutions,  or  oscillation. 

These  problems  ere  directly  attributable  to 
changes  in  the  centralized  version,  a  global  node 
ordering  for  the  relaxation  is  precomputed  using 
a  maximal  spanning  tree  based  on  the  traffic  vol¬ 
ume  at  each  node.  This  ordering  is  very  Important 
in  reducing  the  effects  of  ill-behaved  comparabil¬ 
ity  relationships  between  nodes  and  non- localized 
interactions  among  nodes.  Current  research  is 
aimed  at  introducing,  in  a  distributed  way,  non¬ 
local  coordination  between  nodes  to  eliminate 
these  problems.  We  are  examining  such  techniques 
as  multi-level  relaxation  and  distributed  versions 
of  the  maximal  spanning  tree  heuristics. 

A  more  detailed  discussion  of  this  research 
is  contained  in  a  forthcoming  report  by  3 rooks 
and  lesser  11973]. 

3.2  AM  SXPE3IMEHT  IN  DISTRIBUTED  INTERPRETATION 

A  second  study  concentrated  on  applying  the 
Hearsay-IX  architecture  to  the  distributed  inter¬ 
pretation  problem,  where  each  processor  is  mobile, 
has  a  set  of  (possibly  non- 'uniform)  sensing  de¬ 
vices,  and  interacts  with  nearby  processors  through 
a  packet-radio  cooaunicatlon  network.  It  is  de¬ 
sired  that  processors  communicate  among  themselves 
to  generate  a  consistent  interpretation  of  "what 
is  happening"  in  the  environment  being  sensed. 

Our  approach  to  decomposing  Hearsay-IX  in 
the  packet-radio  network  environment  i3  based  on 
the  following  premise:  the  only  cost-effective 
way  to  decompose  Kearsay-IX  is  to  modify  the 
trobiem  solving  strategy  to  work  with  processors 


operating  on  partial  and  possibly  inconsistent 
views  of  the  current  interpretations  and  system 
state. 

There  are  two  major  ideas  necessary  to  imple¬ 
ment  the  distributed  version  of  the  Heersav-II 
architecture.  The  first  comes  from  the  relaxation 
paradigm,  where  the  current  state  ( results )  of  a 
processor  node  is  accsssed  only  by  nodes  that  are 
neighboring  in  the  node  network.  If  the  current 
state  of  the  node  is  "important"  or  "relevant", 
the  state  will  be  transmitted  gradually  to  other 
areas  of  the  node  network,  resulting  in  what  can 
be  thought  of  as  a  spreading  excitation  of  impor¬ 
tant  information.  This  same  approach  may  be  used 
to  decentralize  the  Hearsay-ll  blackboard.  Not 
all  processor  nodes  have  to  immediately  receive 
all  information  that  is  pertinent  to  then:  if  the 
information  is  really  important,  it  will  eventu¬ 
ally  be  spread  through  the  network  to  all  nodes. 

The  second  major  idea  comes  form  viewing  a 
processor  node  at  a  generator  function  which  can 
be  successively  re- invoked  as  needed  to  generate 
additional  (and  possibly  less  credible)  hypothe¬ 
ses  for  a  particular  portion  of  the  interpretation. 
Generator  function  re- invocation  is  achieved 
through  the  use  of  threshold  values  (part  of  the 
focus  of  control  data  base)  which  ere  modified  by 
the  seats  spreading  excitation  scheme  used  to 
transmit  important  information. 

Using  these  two  ideas,  we  have  developed  a 
system  which  permits  sffectiva  inter-node  cooper¬ 
ation  without  the  high  communication  bandwidth 
required  to  support  a  cosipletely  centralized  data 
base  and  controller.  This  lower  bandwidth  re¬ 
quirement  is  accomplished  by  transmitting  only  a 
limited  subset  of  the  results  generated  at  each 
node  to  only  a  small  number  of  other  processing 
nodes ■ 

A  sat  of  experiments  to  determine  how  the 
distributed  decomposition  of  Heersev-II  affects 
its  problem  solving  behavior  is  now  being  analyzed. 
Asoects  of  behavior  studied  include  the  accuracy 
of  the  interpretation,  the  time  required  for  inter¬ 
pretation,  the  amount  of  inter-node  communication 
required,  and  the  robustness  of  the  system  in  the 
face  of  communication  hardware  malfunction  (i.e., 
coammnication  without  positive  acknowledgment) . 
These  experiments  were  only  in  part  simulations, 
sines  they  used  an  actual  interpretation  system 
analyzing  real  data:  the  Hearsay-II  speech  under¬ 
standing  system  [Srman  and  lesser  19781 . 

Tentative  results  of  these  experiments  in¬ 
dicate  that  a  simplified  version  of  the  distribu¬ 
ted  Hearsay-II  architecture  as  applied  to  speech 
data  (i.e.,  different  nodes,  each  with  a  complete 
3et  of  knowledge  sources,  processing  overlapping 
segments  of  the  acoustic  data)  performs  as  well 
as  the  centralized  system.  The  distributed  sys¬ 
tem  produces  these  results  using  a  low  degree  of 
interprocessor  communication  (fewer  than  50%  of 
the  locally  generated  hypotheses  need  to  be  trans¬ 
mitted)  .  In  addition,  the  distributed  system  con- 
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tinues  to  function,  although  in  a  somewhat  degrad¬ 
ed  manner  (i.e.,  lower  recognition  accuracy),  with 
a  noisy  interprocessor  conamnication  channel  (a 
randomly  selected  2S%  of  transmitted  messages 
lost)  . 

A  forthcoming  paper  by  Lesser  and  Srman  [1978] 
further  details  this  research. 

4.  CURRENT  RESEARCH  DIRECTIONS 

Our  exploratory  research  in  the  development 
of  functionally  accurate  cooperative  distributed 
problem  solving  techniques  has  produced  promising 
results.  This  work  has  highlighted  many  key  de¬ 
sign  issues  that  roust  be  dealt  with  in  the  devel¬ 
opment  of  a  general  theory  for  functionally  accu¬ 
rate  cooperative  distributed  systems.  We  believe 
that  sufficient  intuition  has  been  obtained  in  the 
initial  stages  of  this  research  to  direct  the  in¬ 
vestigation  to  a  set  of  more  generic  research  top¬ 
ics  in  knowledge-based  AI  problem  solving  and  in 
organizational  theory.  The  results  of  this  re¬ 
search  will  be  important  in  shaping  a  general 
theory . 

Further  investigation  of  specific  applica¬ 
tions  remains  appropriate  to  the  continuing  devel¬ 
opment  of  intuitions  regarding  the  key  design  is¬ 
sues.  However,  the  selection  of  these  should  be 
done  in  a  more  directed  way,  relating  directly  to 
generic  research  topics.  The  emphasis  of  our 
current  research  is  on  an  investigation  of  the 
following  two  generic  topics. 

4.1  DISTRIBCTED  SEARCH 

Experiences  with  the  two  distributed  inter¬ 
pretation  applications  discussed  in  Section  1  has 
led  us  to  the  conjecture  that  all  functionally 
accurate  cooperative  distributed  structures  have 
at  their  heart  distributed  search.  Distributed 
search  involves  the  integration  of  partial  results 
emanating  from  multiple,  semi- independent  (discon¬ 
nected)  loci  of  search  control.  An  Adequate  model 
for  describing  and  analyzing  distributed  search 
techniques  has  not  been  developed.  This  lack  of 
a  developed  model  of  distributed  search  exists  even 
in  centralized  environments  such  as  tha  Hearsay-II 
speech  understanding  system. 

We  are  developing  a  modal  for  distributad 
search  that  provides  a  common  framework  for  addres¬ 
sing  the  following  questions: 

•What  is  the  nature  of  the  search  spaca  in  a 
given  application? 

•How  is  tha  search  space  represented  in  the 
system  (levels  of  abstraction,  data  structure, 
etc. ) ? 

•What  is  tha  nature  of  a  partial  result  in 
this  representation? 

•How  are  partial  results  extended  and  merged 


•What  is  the  nature  of  information  that  must 
be  communicated  between  semi-independent  loci  of 
search  control? 

•How  are  certain  types  of  error  resolved  as 
part  of  the  search  process? 

•How  are  the  semi- independent  loci  of  search 
control  coordinated? 

•How  is  the  search  spaca  searched  (31? 

We  are  developing  this  model  by  generalizing 
four  distributed  search  techniques:  relaxation, 
the  locus  model,  cooperating  experts,  and  hypoth- 
esize-and-test  (ala  Hearsay-II) .  we  hope  this 
model  will  have  a  taxonomic  character  that  will 
provide  a  framework  in  which  new,  alternative 
search  techniques  become  apparent.  The  model 
should  also  lead  to  the  development  of  a  small  set 
of  control  primitives  and  data  structures  wnich 
are  appropriate  for  all  types  of  distributed 
search  techniques  and  applications. 

4 . 2  DISTRIBUTED  PLANNING 

Experiences  with  the  two  distributed  inter¬ 
pretation  applications  has  also  lead  us  to  under¬ 
stand  that  distributed  focus  of  attention  is  a 
crucial  aspect  of  all  functionally  accurate  coop¬ 
erative  distributed  systems.  Distributed  focus  of 
attention  involves  the  dynamic  allocation  of  pro¬ 
cessing  power,  memory,  data,  and  communication 
capability.  We  feel  distributed  focus  of  atten¬ 
tion  is  part  of  the  larger  issue  of  distributed 
planning  (focus  of  attention  is  planning  which  is 
directed  at  one '  s  own  iamiediate  internal  proces¬ 
sing)  . 

To  approach  distributed  claiming  and  distrib¬ 
uted  focus  of  attention  in  the  same  way  as  we  ap¬ 
proach  distributed  search  (i.e.,  the  development 
of  a  generic  model)  seems  premature.  There  is 
ouch  less  experience  with  distributed  focus  of 
attention  and  distributed  planning,  and  sufficient 
intuitions  have  not  yet  been  developed.  Therefore 
a  more  directed,  experimental  approach  in  this 
area  seems  appropriate. 

We  ire  engaged  in  two  investigations  to  help 
develop  these  needed  intuitions.  The  first  inves¬ 
tigation  involves  determining  the  implications  of 
reorganizing,  for  a  distributed  environment,  two 
existing  focus  of  attention  techniques:  shortfall 
density  scoring  [Woods  1977]  and  the  Hayes-Roth 
and  Lesser  focus  of  attention  mechanism  [Hayes- 
Roth  and  Lesser  1977] .  The  shortfall  density 


[3!  Turrent  characterizations  of  search  us  me  sue.-, 
terms  as  breadth-first  and  depth-first  are  inade¬ 
quate  even  in  centralized  environments. 


scoring  technique  is  an  admissible  focus  of  atten¬ 
tion  strategy  and  the  Haves-Roth  and  Lesser  mech¬ 
anism  is  a  heuristic  strategy  based  on  a  number  of 
general  focus  of  attention  principles. 

The  second  investigation  approaches  the  more 
general  issue  of  distributed  planning.  The  analo¬ 
gy  between  most  current  planning  techniques  and 
distributed  planning  is  similar  to  that  between 
heuristic  search  and  distributed  search  (i.#.,  in 
most  planning  systems  there  is  only  one  locus  of 
control)  . 

We  are  taking  existing  AI  planning  techniques 
and  generalizing  them  to  more  than  one  locus  of 
control  in  which  the  planning  proceeds  in  a  func¬ 
tionally  accurate  cooperative  manner.  We  are  also 
taking  models  of  group  planning  developed  in  Organ¬ 
izational  Theory  and  placing  them  in  a  computation¬ 
al  framework. 

5 ■  CONCLUSION 

We'  feel  methodologies  can  be  developed  for 
functionally  accurate  cooperative  distributed  sys¬ 
tems  in  which  the  distributed  algorithms  and  con¬ 
trol  structures  function  with  both  inconsistent 
and  incomplete  data.  These  methodologies  are  ne¬ 
cessary  in  order  to  extend  the  range  of  applica¬ 
tions  that  can  be  effectively  implemented  in  dis¬ 
tributed  environments.  •«**  current  generic  re¬ 
search  studies  in  the  areas  of  distributed  search 
and  distributed  planning  as  outlined  in  the  paper 
will  aid  in  the  development  of  a  general  theory  for 
functionally  accurate  cooperative  distributed  sys¬ 
tems. 
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ABSTRACT 

’  The  principal  features  of  distri¬ 
buted  intelligent  (DI)  systems  include  a 
diversity  of  perspectives  arising  from  separated 
individuals  and  a  problem-solving  method  whereby 
the  individuals  can  coordinate  their  activities 
to  accomplish  a  collective  objective. 
These  features  require  numerous  capabilities 
and  supporting  structures.  A  system  that  embo¬ 
dies  such  capabilities,  however,  seems  especial¬ 
ly  well  suited  to  situation  assessment  (SA) 
tasks.  These  tasks  require  the  collection, 
analysis,  and  interpretation  of  widely 
disparate  data.  Moreover,  comparably  complex  and 
distributed  systems  are  often  required  to 
disseminate  the  results  ^aif  these  analyses 
effectively.  In  this  paper,  we  explore-^the  rela¬ 
tionships  among  structural  capabilities  of  DI 
systems  and  functional  requirements  of  effec¬ 
tive  SA  systems.  We—  discuss  some  of  the  op¬ 
tions  and  trade-offs  of  various  architectural 
decisions  in  the  context  of  two  contrasting 
DI  designs. 


1 .  INTRODUCTION 

Intelligent  behavior  depends  primarily  on  two 
capabilities:  first  perceiving  and  interpreting 
the  environmental  situation,  and  then  performing 
actions  that  seem  likely  to  help  attain  goals 
while  preventing  potential  undesirable  outcomes. 
Ve  cefer  to  the  first  type  of  cask  as  situation 
assessment  (SA).  The  second  general  class  of  ac¬ 
tivities  we  refer  to  as  p fanning  and  control  (PC). 
3ecause  most  human  activity  aims  to  achieve  goals 
and  avoid  risks,  SA  and  PC  tasks  are  ever  present. 

Although  the  bulk  of  previous  research  on  in¬ 
telligent  systems  has  focused  primarily  on  how  an 
individual  person  or  machine  might  accomplish  SA 
and  PC  tasks  intelligently,  we  believe  that  the 
most  significant  and  important  problems  of  human 
culture  are  approached  collectively.  We  refer  to 
systems  that  coordinate  the  activities  of  separate 
intelligent  individuals  as  distributed 
intelligence  (DI)  systems.  DI  systems  behave  in¬ 
telligently  by  coordinating  decisions  of  separated 
individuals  who  can  communicate  limited  amounts  of 
information  to  one  another. 

SA  tasks  generally  require  the  collection  of 
information  from  disparate  sensor  systems  that  may 
vary  in  their  locus,  specific  sensitivities,  or 
performance  characteristics.  Because  the  purpose 
served  by  SA  is  to  enable  an  organism  or  organiza¬ 
tion  to  plan  appropriate  behaviors,  the  important 
outputs  of  an  SA  system  often  span  a  range  of  con¬ 
cerns.  In  support  of  a  military  campaign,  for  ex¬ 
ample,  different  commanders  will  require  analyses 
that  vary  according  to  their  own  battle  management 
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responsibilities.  Both  the  data  for  and  results 
of  such  SA  tasks  are  highly  distributed  geographi¬ 
cally.  This  leads  us  to  believe  that  DI  systems 
provide  a  natural  architecture  for  SA.  This  paper 
first  examines  the  principal  features  of  SA  and  DI 
to  strengthen  this  intuition.  We  then  define  a 
large  apace  of  alternative  DI-SA  system  architec¬ 
tures,  and  consider  two  specific,  extreme  exam¬ 
ples.  The  contrast  between  these  two  suggests 
numerous  research  questions  which  are  briefly  not¬ 
ed. 


2.  THE  CONCEPT  OF  DISTRIBUTED  INTELLIGENCE 

In  our  pursuit  of  conceptualizations  of  DI 
and  their  relationship  to  SA,  we  have  found  it 
useful  to  schematize  the  core  concepts  in 
representations  akin  to  those  used  in  knowledge- 
based  systems  (cf.  Lenat,  1975,  1977;  Martin  et 
al.,  1977;  Haycs-Roth  et  al.,  1978).  Figure  1 
displays  in  outline  fora  the  concept  of  distribut¬ 
ed  intelligence  and  some  of  its  principal  facets. 
It  succinctly  describes  our  current  view  of  DI . 
In  the  remainder  of  this  section  we  elaborate  on 
some  of  the  more  interesting  and  less  obvious 
characteristics. 

2.1  Individuals 

In  our  view,  DI’s  most  interesting  property 
arises  from  the  multiple  viewpoints  or  perspec¬ 
tives  of  its  diverse  individuals.  Within  a  "per¬ 
spective  , "  we  intead  to  include  any  measurement 
system,  frame  of  reference,  or  internal  represen¬ 
tation  scheme  that  distinguishes  one  individual 
from  another.  Thus,  owing  to  their  potentially 
varying  features  or  loci,  different  individuals 
may  assess  the  same  observable  differently.  Sub¬ 
sequent  inferences  by  these  individuals  may  furth¬ 
er  diverge. 

These  differing  perspectives,  while  powerful, 
msy  be  difficult  to  reconcile.  In  areas  of  over¬ 
lapping  responsibilities,  conflicts  may  occur  as 
the  individuals  involved  each  attempt  to  interpret 
the  same  data  in  different  ways.  Resolving  these 
conflicts  can  prove  to  be  difficut  and  expensive. 
Complicating  matters  further,  the  limited  communi¬ 
cation  channels  usually  available  greatly  reduce 
the  effective  sharing  of  these  diverse  perspec¬ 
tives.  All  this  combines  to  generate  the  ill- 
founded,  "accomodating"  solutions  which  character¬ 
ize  much  of  today's  organizational  behavior. 

2.2  Organization 

W'e  identify  the  foremost  features  of  organi¬ 
zations  as  their  structure  and  government,  ootn 
of  these  directly  affect  now  veil  the  system  can 
achieve  its  purposes.  Structure  provides  1  static 
characterization  of  tne  system's  behavior,  govern- 


CONCEPT  SAME 


Distributed  Intelligence  (DI) 


DEFINITION  A  DI  system  consists  of  a  set  of  individuals  who 

cooperate  in  solving  a  problem  and  who  necessarily 
have  different  perspectives  (owing  to  different  data, 
limited  communications ,  etc.) 

EXAMPLES 

COMMERCIAL  committees,  businesses,  trucking,  economies 

COMMUNICATIVE  telephones,  stock  market,  banking  systems 

ANATOMICAL  embryo,  body,  nervous  system,  visual  pathway 

SURVEILLANCE  infectious  disease  control,  DIA,  CIA,  NSA,  fusion  centers 
PHYSICAL  spider  web,  football  teams,  armies 

INDIVIDUALS 

FEATURES  sensors,  knowledge,  memory,  processing,  transmission 

NICHE  environmental  resources,  demands,  opportunities 

LOCUS  space,  time,  culture 

PERSPECTIVE  How  does  what  individuals  sense/ know/ transmit  differ? 
FUNCTIONS  notice,  interpret,  seek  information 

negotiate,  confirm,  repudiate,  and  elaborate  decisions 
store,  forget,  transmit 

THINK-TALK  RATIO  computation  rate/ communication  rate 

ORGANIZATION 

STRUCTURE  Whose  resources/perspective  does. an  individual  share? 

communication  network;  shared  communication  contexts 
GOVERNMENT  How  are  authority/ responsibility  allocated? 

policies  and  adjudication 
power  and  enforcement 

CONNECTEDNESS  average  number  of  individuals  each  individual  talks  to 
COMMUNICATION-EFFICIENCY 

proportion  of  communication  requests  serviceable  without 
delaying  any  individual 
COOPERATION-EFFICIENCY 

communication-efficiency  /  think-talk  ratio 


INTELLIGENT  BEHAVIOR 

PERCEPTION  recognizing  instances  of  generic  categories 

ATTENTION  focusing  on  perishable, important  issues 

DEDUCTION  logical  reasoning 

INTERPRETATION  modeling  purposes  and  intentions 

INDUCTION  plausible  reasoning,  modifying  knowledge 

PREDICTION  generating  expectations 

GOAL  SETTING  formulating  subproblems  and  objectives 

SEEKING  searching  for  needed  information 

ANSWERING  replying  to  others'  questions 

LANGUAGE  cooperative  communication  conventions 

abstractions  and  symbols 

KNOWLEDGE  world  models,  multiple  abstractions 

MEMORY  storing,  retrieving  information 

EMERGENT  PROPERTIES 

DEFINITION  unpredicted  macroscopic  system  behavior 

EXAMPLES  evolution  through  natural  selection 

centralization  of  authority  in  crises 
INFLUENCES  system  complexity 

environmental  forces 
organizational  adaptability 
CONSEQUENCES  potentially  very  significant 


Figure  1.  A  conceptual  description  of  DI. 


meat  its  dynamic  aspects . 


Our  view  of  organizational,  structure  differs 
from  conventional  ones  in  its  emphasis  on  informa¬ 
tion  processing  as  the  organizing  principle.  This 
special  focus  ignores  other  sources  of  organizing 
influence,  such  as  institutional  forces  or  the  re¬ 
quirements  of  production,  distribution,  sales, 
etc.  Within  this  pure  information  processing 
framework,  many  interesting  issues  arise. 

One  structure  gives  immediate  or  proximate 
access  to  all  parties  interested  in  comparable 
data  (.by  associating  multiple  individuals  with  a 
common  memory',  for  example).  More  generally,  a 
system  can  facilitate  communication  by  arranging 
for  different  individuals  to  share  perspectives. 
This  in  turn  creates  opportunities  for  related  in¬ 
dividuals  to  encode  their  messages  efficiently  in 
ways  that  each  other  can  decode,  e.g.,  through  the 
use  of  jargon. 

We  define  the  government  of  a  01  system  as 
the  set  of  processes  which  controls  the  interac¬ 
tion  among  individuals.  To  define  a  government  one 
decides  which  individuals  are  responsible  for  what 
actions  and  which  have  authority  to  modify  or  con¬ 
trol  the  behavior  of  others.  Implementing  a  go¬ 
vernment  is  then  accomplished  by  giving  power  to 
those  with  authority  and  compelling  them  to  en¬ 
force  the  governmental  policies. 

Unfortunately,  numerous  simple  policy  objec¬ 
tives  (such  as  "fair”  elections)  are  not  achiev¬ 
able  by  DI  systems  even  in  theory  (cf.  Arrow, 
1951).  Empirically,  moreover,  it  seems  that  very 
few  21  systems  even  correctly  implement  their  ex¬ 
plicit  policies.  Even  if  there  were  no  theoreti¬ 
cal  obstacles,  the  problems  of  controlling  compet¬ 
ing  and  cooperating  individuals  in  their  pursuit 
of  multiple,  sometimes  incompatible  objectives 
exceeds  the  capabilities  of  most  DI  systems.  As  a 
manifestation  of  this  limitation,  many  adaptive 
systems  exhibit  great  flexibility  in  the  execution 
of  governmental  functions.  Vague  policies,  his¬ 
torical  precedents  and  opportunistic  decisionmak¬ 
ing  often  achieve  governmental  robustness  and 
flexibility  where  explicit  policy  implementations 
could  not. 

2.3  Intelligent  Behavior 

Presumably  DI  systems  exhibit  intelligent 
behavior.  What  exactly  do  we  mean  by  intelli¬ 
gence?  Without  attempting  to  formalize  a  defini¬ 
tion,  we  can  usefully  identify  a  number  of  capa¬ 
bilities  that  intelligent  systems  possess.  Most 
of  these  occur  in  all  DI  systems,  and  we  antici¬ 
pate  that  they  would  be  equally  useful  in  new  ap¬ 
plications  of  DI  to  SA  tasks.  The  aspects  of  in¬ 
telligent  behavior  listed  in  Figure  1  provide  a 
good  overview  of  these  capabilities .  We  will  con¬ 
sider  a  few  of  these  in  more  detail  in  this  sec¬ 
tion. 

The  first  class  of  capabilities,  including 
perception,  attention,  deduction  and  interpreta¬ 
tion,  might  best  be  called  model-based  reasoning . 


This  type  of  reasoning  compares  sensory  data  with 
internal  models  or  expectations  to  test  hypotheses 
about  the  source  and  meaning  of  the  observables. 
To  achieve  efficiency,  most  intelligent  systems 
focus  their  attention  on  important  events  or  lata 
by  allocating  scarce  processing  resources  in  ac¬ 
cordance  with  the  expected  significance  of  the 
processing  outcome.  One  basic  process  is  used 
both  to  perceive  objects  and  interpret  events. 
Matching  processes  identify  the  most  likely  cause 
of  observed  data  from  among  a  potentially  large 
set  of  stored  pattern  templates  or  generative 
models.  This  stored  set  of  patterns  comprises 
the  bulk  of  the  system's  declarative  knowledge. 
The  matching  processes  themselves  exploit  primari¬ 
ly  simple  deductive  schemes,  though  inductive  rea¬ 
soning  is  often  necessary  to  assess  the  credibili¬ 
ty  of  alternative  hypotheses. 

The  second  general  category  of  capabilities 
might  best  be  considered  planning.  This  category 
would  include  induction,  prediction,  goal  setting, 
and  information  seeking  skills.  Basically,  these 
skills  support  goal-directed  reasoning.  Inductive 
methods  enable  the  system  to  infer  likely  outcomes 
of  current  or  planned  events.  These  predictions 
can  be  used  to  evaluate  prospective  alternatives 
or,  in  a  reverse  way,  to  search  for  feasible 
methods  of  achieving  desired  goals.  Often,  of 
course,  no  immediate  plan  can  be  found  to  achieve 
some  goal,  because  additional  knowledge  or  infor¬ 
mation  is  required.  In  this  case,  the  intelligent 
system  sets  appropriate  information-seeking  goals 
and  plans  methods  to  satisfy  these  subordinate 
goals. 

The  third  class  of  intelligent  capacities  we 
identify  as  knowledge  management.  These  concern 
the  storage,  retrieval,  and  communication  of 
knowledge.  The  diversity  of  perspectives  in  01 
systems  create  special  problems  for  knowledge 
management.  Frequently,  a  system  "knows"  some¬ 
thing  but  is  unable  to  use  that  knowledge  because 
the  relevant  individual  can't  find  it.  In 
response  to  such  problems,  many  DI  systems  develop 
schemes  for  partitioning  knowledge  in  arbitrary 
ways.  An  interested  individual  might  ask  someone 
who,  in  turn,  knows  who  to  ask  to  find  the 
knowledge.  These  retrieval  schemes  often 
correspond  to  hierarchical  (associative)  indexes. 
Alternatively,  many  systems  develop  knowledge 
management  specialists  to  store  and  index 
knowledge  centrally,  complete  with  their  own 
languages  and  communication  protocols. 

2-d.  Emergent  Properties 

DI  systems  provide  an  unparalleled  opportuni¬ 
ty  for  the  study  of  emergent  properties.  This 
term  refers  to  macroscopic  behavioral  characteris¬ 
tics  of  a  system  that  appears  as  a  surprising 
consequence  of  individual  features.  The  complexi¬ 
ty  of  DI  systems  fosters  the  development  of  sucn 
emergent  properties.  Just  as  most  government ji 
planners  learn  that  nearly  all  policies  have  unin¬ 
tended  but  significant  consequences,  so  -ill  most 
DI  systems  manifest  interesting  and  potentially 
important  macroscopic  properties. 
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Emergent  properties  tan  affect  DI  sysems  in 
many  -ays.  They  can  directly  afreet  the  system's 
capability  to  perform  well  in  its  environmental 
mcne.  Some  emergent  properties  le.g.,  competi¬ 
tion  for  limited  resources)  Lead  to  adaptive 
consequences,  while  others  (.e.g.,  information 
gluts)  lead  to  undesirable  ones.  They  can  also 
affect  the  organization  indirectly.  DI  systems 
often  react  to  their  own  emergent  properties  by 
actively  trying  to  assess,  exploit,  and  control 
them.  Such  practices  might  be  considered  open- 
loop  feedback  control.  However,  this  type  of  con¬ 
trol  is  quite  difficult  to  achieve.  By  their  very 
nature,  emergent  properties  are  surprising  and  may 
not  be  easily  detected  or  correctly  interpreted. 
Attempts  to  perturb  them  may  well  have  their  own 
unexpected  results,  with  total  confusion  the  out¬ 
come.  nevertheless,  it  seems  clear  that  purposive 
DI  systems  need  to  address  this  type  of  assessment 
and  control  problem.  Otherwise,  there  may  be  lit¬ 
tle  assurance  that  the  syscem  will  continue  to 
achieve  its  objectives  as  the  environmental  condi¬ 
tions  vary. 


3.  THE  CONCEPT  OF  SITUATION  ASSESSMENT 

The  concept  of  situation  assessment  and  its 
principal  facets  are  schematized  in  outline  form 
in  Figure  2.  In  the  following  subsections,  we 
briefly  consider  some  of  the  important  features  of 
the  SA  task. 

3.1  Hypotheses  and  Tasks 

In  most  complex  environments,  the  primary 
output  of  SA  consists  of  a  set  of  hypotheses  con¬ 
cerning  the  identity,  location,  capabilities,  and 
intentions  of  the  monitored  elements.  These  will 
usually  be  multiple  hypotheses  corresponding  to 
any  set  of  sensor  reports,  since  observations  are 
generally  incomplete  and/or  errorful.  The  general 
strategy  for  resolving  uncertain  hypotheses  is  to 
find  additional  information  to  confirm  the  valid 
hypotheses  3nd  disconfirm  Che  erroneous  alterna¬ 
tives.  Frequently,  this  information  is  not  im¬ 
mediately  accessible.  Somehow  the  system  should 
acquire  the  needed  data  in  a  timely  fashion  and 
then  route  it  automatically  to  the  relevant 
analyst  who  knows  why  it  was  gathered  and  how  to 
utilize  it  efficiently  for  its  intended  purpose. 

3.2  Cooperating  Sources  of  Knowledge 

SA  typically  requires  applying  diverse 
sources  of  knowledge  to  the  interpretation  prob¬ 
lem.  This  diversity  can  reflect  separation  of 
sensor  observations  in  time  or  space  or,  more  gen¬ 
erally,  different  types  of  interpretative  exper¬ 
tise.  Among  these  types  of  expertise  we  frequent¬ 
ly  require  knowledge  about  how  sensors  respond 
differentially  to  potential  events,  how  the  en¬ 
vironment  affects  both  the  sensors  and  the  moni¬ 
tored  elements,  and  how  the  monitored  elements  are 
expected  to  behave.  The  knowledge,  in  each  case, 
can  be  considered  a  mode  1  of  behavior.  Thus,  SA 
incorporates  sensor  models,  environmental  models, 
ur.d  force  models. 


For  military  applications,  force  models  are 
particularly  important.  The  foremost  objective  in 
this  type  of  application  is  to  understand  what  the 
opposing  forces  intend  to  Jo  and  are  capable  of 
doiog.  To  achieve  this  type  of  understanding,  we 
need  to  construct  an  interpretation  of  their  force 
disposition  and  activities.  Our  interpretations 
necessarily  combine  our  diverse  sources  of 
knowledge  with  current  sensor  reports  to  attribute 
intentionality ,  capability,  and  specific  locations 
to  the  various  elements  of  their  forces.  To  the 
extent  that  our  interpretive  methods  and  support¬ 
ing  data  are  conclusive,  our  interpretations  are 
valid.  Conversely,  if  our  methods  and  data  are 
inconclusive,  our  hypothetical  interpretations  are 
necessarily  uncertain. 

3.3  Inference  Processes 

We  have  already  suggested  that  the  basic 
method  of  SA  is  hypothesize  and  test.  This  label 
identifies  a  particular  paradigm  of  problem¬ 
solving.  To  apply  it  Co  SA  tasks  requires  that 
the  diverse  types  of  knowledge  can  contribute 
directly  to  the  problem  ot"  mapping  between  observ¬ 
able  data  and  various  intermediate  levels  of  in¬ 
terpretations.  Thus,  we  use  data  from  an  earlier 
SA  as  well  as  current  sensor  reports  to  suggest 
plausible  hypotheses  about  how  the  current  situa¬ 
tion  should  be  viewed.  These  new  hypotheses,  in 
turn,  require  confirmation.  Supporting  evidence 
is  used  to  strengthen  the  credibility  of  hy¬ 
potheses,  and  contradictory  data  is  used  to  elim¬ 
inate  alternative  hypotheses  from  consideration. 

Two  principal  problems  occur  in  SA:  too  much 
raw  data  and  too  little  high-level  interpretation. 
These  problems  are  connected  intimately.  If  we 
had  fewer  sensor  reports  to  evaluate,  we  could  at 
least  subject  them  to  extensive  analyses.  On  the 
other  hand,  the  usual  reason  we  collect  so  much 
information  is  to  provide  additional  means  of  el¬ 
iminating  uncertainty  from  our  resulting  hy¬ 
potheses.  Thus,  the  core  problem  is  to  determine 
which  hypotheses  are  most  critical  and  which  data 
directly  contribute  to  their  development  and 
evaluation. 

Several  factors  are  involved  in  deciding  how¬ 
to  allocate  limited  analysis  resources  in  complex 
SA  tasks.  First,  many  data  need  not  be  attended 
to  at  all;  these  are  data  that  are  completely 
consistent  with  reliable  predictions  about  what 
the  sensors  will  see  and  report.  Second,  in  most 
cases,  our  own  intentions  significantly  affect 
what  data  we  need  to  collect  and  interpret.  Just 
as  a  runner  generally  needs  only  to  look  directly 
ahead  for  obstacles  and  dangers,  so  do  most  intel¬ 
ligent  organizations  have  specific  missions  to  ac¬ 
complish  that  can  focus  their  SA  effectively.  Fi¬ 
nally,  many  SA  problems  can  be  decomposed  into 
separable  tastes.  In  many  cases,  the  separable 
tasks  concern  different  spatial  or  Functional 
areas  of  assessment,  3nd  these  naturally  suggest 
parallel  computations  by  separate  components.  Al¬ 
ternatively,  even  when  parallelism  is  not  ap¬ 
propriate,  SA  subtasks  often  exhibit  especiaiiv 
etficient  natural  orderings.  For  example,  v e r - 
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CONCEPT  NAME 


Situation  Assessment  (SAJ 


DEFINITION  SA  tasks  involve  interpreting  and  predicting 

the  behavior  of  a  set  of  elements  chat  can 
affect  selected  aspects  of  the  environment. 

EXAMPLES 

BATTLEFIELD  status  of  forces,  rules  of  engagement,  tactics,  objectives 
DISEASE  CONTROL  type,  locus,  virulence,  resistance,  populations  of  pathogens 
INVENTORY  current  and  projected  stock  needs 

ECONOMIC  FORECASTING 

prices,  wages,  inflation,  money  supply,  inventory 
AIR  DEFENSE  status  of  own  and  other  air  farces, 

OUTPUTS 
HYPOTHESES 
TASKS 

SENSORS 

CAPABILITIES 
MODELS 
REQUIREMENTS 

KNOWLEDGE  SOURCES 
SENSOR  REPORTS 

ENVIRONMENT  geography,  terrain,  weather 
FORCE  MODELS  Force  elements:  tanks,  RP'.'s ,  etc. 

Force  structure 

Force  applications:  what  can  they  do,  how? 

Force  status:  where  are  they? 

Force  intentions: 

what  purposes  are  they  seeking? 
how  would  they  achieve  that? 

Vulnerabilities: 

how  are  we  vulnerable? 

what  activities  of  theirs  should  deserve  our  attention? 
PRIOR  SA  expect  the  future  to  be  much  like  the  past 


what,  when,  where,  why,  how,  how  much? 
what  additional  information  is  needed,  where 
can  it  be  obtained,  and  who  should  get  it? 

where,  what,  when,  accuracy,  noise 
generative  models  of  their  behavior 
power,  communications ,  computation,  terrain 


INFERENCE  PROCESSES 
METHOD  hypothesize  and  test 

KNOWLEDGE  REPRESENTATION 


EXPECTATIONS 

CONFLATION 

DATA  GLUT 

REDUCTION 

CRITICALITY 

PARALLELISM 

SERIA1ITY 


multilevel  models  mapping  intentions  into  observables 
use  prior  SA  and  knowledge  sources  to  predct  events 
combine  sensor  reports  with  knowledge-based 
predictions  to  validate  critical  hypotheses 
too  much  low-level  data — which  to  store,  which  to  discard 
perishability  and  criticality 
remove  expectable  data  from  consideration 
focus  attention  on  surprising,  important  data 
geographically  disparate  events  are  nearly 
independent  and  can  be  pursued  in  parallel 
gross  assessments  should  precede  detailed  ones 


SECURITY 

VULNERABILITY  what  can  be  revealed,  what  can't? 
COMMUNICATIONS  who  threatens  your  communications  and  how? 


RISK  CONTROL 

SURPRISE  recognizing  when  expectations  are  violated 

generating  new  models  of  behavior 
RESOURCE  ALLOCATION 

some  resources  to  interpretation;  some  for  surprises 


Figure  2.  A  conceptual  description  of  SA. 
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ill  gross  assessments  can  sometimes  obviate 
numerous  alternative  fine-grained  considerations. 

3. a  Security  and  Risk  Control 

An  organization  established  to  accomplish  SA 
often  faces  two  different  types  of  threats.  Their 
ability  to  monitor  the  forces  of  interest  may  be 
curtailed  or  subverted.  Furthermore,  they  may  be 
made  foolish  by  surprise.  Both  of  these  threats 
significantly  affect  the  design  of  SA  systems. 

The  threat  to  security  comes  primarily  from 
the  fact  that  an  intelligent  opposition  can  thwart 
one’s  ability  to  collect  or  communicate  important 
information.  Threats  like  this  cause  SA  systems 
to  perform  their  functions  under  the  protection  of 
both  physical  and  communications  security  (through 
encryption,  etc.).  Once  the  opposition  knows  what 
can  be  observed  and  how  it  is  processed,  it  can 
often  hide  its  force  elements,  modify  or  disguise 
its  emissions,  or  capture  or  occlude  sensors. 

Surprise  often  plays  a  significant  role  in 
military  campaigns.  Because  SA  is  inherently  an 
interpretive  process,  it  exploits  prior  knowledge 
to  generate  expectations  and  comprehend  the  ob¬ 
served  data.  In  this  way,  knowledge  simultaneous¬ 
ly  provides  the  basis  for  understanding  and  the 
basis  for  surprise.  An  opposing  force  that 
behaves  in  an  unforeseen  or  incomprehensible  way 
can  befuddle  the  SA  organization.  Our  view  of 
surprise  is  complementary  to  that  of 
interpretation--that  surprising  events  can  be  mon¬ 
itored  by  their  contrast  with  expected  phenomena. 
This  mechanism  at  least  provides  a  clue  to  the 
lata  that  deserve  (or  demand)  further  attention. 
If  surprising  data  are  left  unattended,  the  SA 
system  cannot  control  the  associated  risks.  To 
accomplish  its  interpretation  task  as  well  as  its 
risk  control  mission,  it  appears  that  the  SA  or¬ 
ganization  must  divide  its  resources  between  m- 
terpretable  and  uninterpretable  phenomena. 


a.  01  ARCHITECTURES  FOR  SA 

Our  current  research  project  is  aimed  at  the 
goal  of  developing  and  refining  architectural 
principles  for  OI-SA  systems.  In  this  work,  the 
concept  of  SA  just  discussed  is  being  elaborated 
to  compare  and  contrast  it  with  that  of  DI.  We 
have  defined  3  testbed  SA  problem  suitable  for  ex¬ 
perimentally  investigating  various  01  architec¬ 
tures.  3eiow,  we  describe  this  problem  and  two 
extreme  structures. 

The  Benchmark  Problem 

In  a  full-scale  battlefield  SA  problem,  we 
might  be  concerned  with  monitoring  locations  and 
movements  of  air  forces,  infantry,  artillery,  and 
armored  Uvisions,  as  well  is  predicting  their 
likely  objectives,  routes,  and  performance  capa¬ 
bilities.  3ecause  this  problem  is  exceedingly 
complex,  we  prefer  to  address  a  drastically  sim¬ 
plified  class  of  problems  initially. 


The  principal  features  of  the  r-ch  SA  problem 
should  be  preserved  in  the  bencnmark  problem,  but 
their  complexity  should  be  parameterized  and  at¬ 
tenuated.  .Among  these  features  we  include:  (1) 
multiple  elements  to  be  interpreted  at  multiple 
levels  of  abstraction;  (2)  the  elements  should  be 
distributed  spatially  and  should  move  through 
space;  (3)  multiple  sources  of  knowledge  should  be 
able  to  provide  constraint;  and  (4)  individual 
sensors  should  be  inadequate  in  scope  or  informa¬ 
tiveness  to  solve  significant  portions  of  the  SA 
problem,  and  (S)  the  value  of  their  data  should 
depend  on  timeliness.  Furthermore,  the  benchmark 
problem  should  not  require  classified  information 
or  military  expertise.  We  intend  to  employ  hy¬ 
brid,  man-machine  simulations  of  Dl-SA  systems,  so 
the  problem  should  be  one  that  can  be  readily  un¬ 
derstood  and  solved  by  ordinary  people  with  little 
formal  training. 

Currently,  we  are  considering  the  following 
SA  benchmark  task,  and  it  will  be  used  to  contrast 
the  two  alternative  architectures.  The  task  is 
called  the  message  puzzie .  It  is  outlined  m  Fig¬ 
ure  3, and  illustrated  in  Figure  4. 

Several  features  of  this  task  are  worth  not¬ 
ing.  First,  it  readily  manifests  the  four  princi¬ 
pal  characteristics  we  desired  in  our  benchmark; 

(1)  the  elements  need  to  be  interpreted  at  the 
letter,  word,  and  message  levels  of  abstraction; 

(2)  the  letters  are  spatially  distributed  and  move 
through  space  in  formations;  (3)  knowledge  about 
the  board,  the  clues  (if  any),  and  the  language 
(lexicon,  syntax,  semantics)  should  provide  multi¬ 
ple  sources  of  constraint;  and  (4)  no  individual 
sensor  will  provide  a  significant  amount  of  the 
needed  information  at  any  one  time,  and  (5)  the 
data  themselves  will  be  of  perishable  value.  In 
addition  to  its  satisfying  these  foremost  objec¬ 
tives,  the  proposed  benchmark  problem  also  exhi¬ 
bits  several  other  desirable  properties.  (6)  The 
granularity  of  the  sensors  and  their  level  of 
abstractions  can  be  readily  changed.  If  we 
wished,  we  could  use  more  sensors  and  have  them 
report  only  grjphemic  parts  of  letters  thus  neces¬ 
sitating  cooperative  analysis  and  interpretation 
of  each  letter.  (?)  The  complexity  of  the  overall 
SA  task  can  be  modulated  at  will.  The  parameters 
that  we  may  vary  include:  the  scope  and  accuracy 
of  the  sensors;  the  size  and  difficulty  of  the 
puzzle:  the  observability,  identifiabilitv ,  speed 
and  coherence  of  the  (word)  formations;  and  the 
expectability  or  surprise  value  of  the  actual  mes¬ 
sages.  (8)  The  SA  organization  can  be  evaluated 
for  its  ability  to  recognize  different  pieces  at 
different  locations  or  at  different  levels  of 
abstraction.  (9)  Each  additional  correct  in¬ 
terpretation  simplifies  further  decision  maxing; 
each  additional  incorrect  hypothesis  complicates 
subsequent  interpretation  decisions. 

With  this  benchmark  problem  m  mind,  we  new 
consider  two  hypothetical  3I-S.4  architectures. 

-.2  The  Dynamic  Hierarchical  Cone 

The  first  urcmtecture  is  a  variant  of  so- 


CONCEPT  NAME 


Message  Puzzle 


DEFINITION  A  situation  assessment  task  where  the  elements  to  be  interpreted 

include  letters  and  words  moving  toward  meaningful  t spatial  and 
semantic)  objectives.  The  world  consists  of  a  crossword  puzzle 
board.  The  words  to  fill  Che  open  slots  move  independently  across 
the  board  until  they  reach  their  target  slots,  at  which  point  they 
stop.  The  SA  task  is  to  guess  as  many  of  the  fillers  as  possible 
before  they  arrive  at  their  slots. 

OUTPUTS 

HYPOTHESES  what  messages  are  the  letters  forming? 

TASKS  what  additional  information  is  needed,  where 

can  it  be  obtained,  and  who  should  get  it? 

SENSORS 

CAPABILITIES  can  see  a  small  block  of  squares,  can  report  the  letters 
in  them,  with  various  degrees  of  latency  and  certainty 
MODELS  rules  regarding  how  the  sensors  perform 

REQUIREMENTS  the  sensors  may  be  limited  by  load  and  comm,  constraints 

KNOWLEDGE  SOURCES 
SENSOR  REPORTS 

ENVIRONMENT  the  puzzle  board  and  some  knowledge  of 

target  characteristics  (eg.,  categories) 

FORCE  MODELS  Force  elements:  letters,  words,  phrases 

Force  structure:  syntax,  semantics,  crossings 
Force  applications :  the  letter  sequences  move 

snake-like  through  the  spaces  toward  destinations 
Force  status:  where  are  they?. 

Force  intentions:  what  targecs  and  meanings  are  they  seeking? 
Vulnerabilities:  which  targets  have  highest  payoffs? 

Figure  1.  A  partial  description  of  the  benchmark  SA  task. 
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called  "perceptual  cooes"  let.  Uhr,  1978;  Tanimoto 
and  Pavlidis,  1975).  Where  ordinary  perceptual 
cooes  define  a  static  hierarchical  structure  among 
processing  individuals  at  different  levels  of 
aostraction,  we  propose  to  link  lower  individuals 
to  higher  ones  in  dynamic  ways.  Specifically,  a 
lower  level  individual  that  has  important  informa¬ 
tion  to  report  requests  a  superior  to  receive  it. 
The  higner- level  superiors  acknowledge  those  re¬ 
quests  that  appear  most  significant  and  timely. 
Thus  the  superior-subordinate  mapping  from  one 
level  to  the  next  may  vary  over  time. 

For  simplicity,  assume  an  8  x  8  puzzle  board 
and  a  three- level  cone  with  one,  four,  and  sixteen 
individuals  at  the  various  levels,  as  shown  in 
Figure  w.  The  lowest-level  individuals  can  "see" 
a  block  of  four  squares  in  that  they  receive  (pos¬ 
sibly  erroriui)  reports  describing  their  contents. 
All  communication  is  vertical,  either  from  a 
subordinate  to  a  superior  or  vice  versa.  Each  in¬ 
dividual  can  send  messages  to  as  many  as  four  in- 
lividuals  and  can  receive  up  to  four  messages  per 
time  step.  If  more  than  four  messages  are  sent  to 
an  individual  in  any  time  step,  randomly  selected 
ones  are  lost  until  just  four  arrive.  Each  indivi¬ 
dual  has  unlimited  computing  and  storage  capaci¬ 
ties,  for  ail  practical  purposes.  The  capacity  of 
one  message  is  limited  to  four  hypothesized  cell 
concents  or  a  scring  message  hypothesis  plus 
corresponding  instructions  (e.g.,  "expect,"  "re¬ 
ject,"  "verify").  Some  simple  credibility  measure 
at  the  hypotheses  is  also  permitted.  Additional 
simple  messages  may  be  communicated  to  control  the 
behavior  of  subordinates,  such  as  "quiesce  for  2 
time  steps”  or  "sleep  until  awakened." 

At  the  outset,  the  board  is  partially  filled 
with  ietter  strings,  and  some  of  these  form  par¬ 
tial  messages.  The  low-ievel  (level  3)  individu¬ 
als  receive  reports  on  their  2x2  windows.  Some 
of  these  are  more  informative  than  others.  For 
instance,  the  letter  sequence  "kw"  which  might 
suggest  "awkward"  but  only  a  few  alternative 
words.  In  response  to  the  "kw”  data,  this  indivi¬ 
dual  might  decide  to  sead  one  message  to  all  supe¬ 
riors  that  predicted  "awkward"  in  the  appropriate 
positions . 

The  immediate  problems  that  arise  from  this 
simple  example  include  the  following.  The  indivi¬ 
duals  don't  know  exactly  which  superiors  are  tak¬ 
ing  responsibility  for  processing  their  hy¬ 
potheses.  Although  this  means  that  the  potential¬ 
ly  wasted  capacity  of  static  cones  has  been  avoid¬ 
ed,  it  also  means  that  time  and  effort  will  be 
spent  deciding  which  subordinates  and  superiors  to 
couple.  If  individuals  are  allowed  Co  play 
multi-level  roles  (thereby  reducing  the  total 
numoer  of  individuals  while  retaining  a  hierarchi¬ 
cal  structure),  the  problem  of  who  to  talk  to  is 
further  compounded. 

Vertical  communications  of  this  sort  have 
benefits  as  well  as  costs.  If  the  network  organ¬ 
izes  itself  into  a  static  quaternary  tree,  a  pred¬ 
iction  can  be  communicated  from  one  side  of  the 
board  to  the  other  in  two  time  steps  by  linking  ap 


and  then  down.  The  higher-level  individuals  can 
develop  plans  for  governing  the  lower- level  ones 
to  maximize  the  net  result.  Once  an  accurate 
prediction  has  been  made  and  confirmed,  it  may  be 
desirable  to  put  sensors  to  sleep  to  minimize  com¬ 
munication  contention.  Collective  wisdom  can  thus 
be  accumulated  at  higher-levels  based  on  multiple 
inputs  without  duplicating  efforts  at  many 
equivalent  sites  in  the  lower-level. 


4.3  The  Anarchic  Committee  Architecture. 

In  the  previous  architecture,  knowledge  accu¬ 
mulated  at  higher- levels  of  abstraction,  and  it 
was  natural  to  suppose  that  power  and  authoricy 
would  be  similarly  distributed.  In  contrast  we 
now  consider  a  design  in  which  no  individuals  are 
distinguisbed  by  knowledge,  authority,  or  power. 
Rather,  the  only  distinctions  that  arise  reflect 
increased  self-assumed  responsibility.  Specifi¬ 
cally,  a  single-level  network  is  proposed  wherein 
individuals  who  believe  they  have  important  infor¬ 
mation  join  with  some  of  their  peers  in  forming  a 
consensus.  Each  decision  then  results  from  the 
formation  of  a  corresponding  committee  that  as¬ 
sumes  responsibility  for  its  judgment.  Later,  if 
any  particular  decision  is  challenged,  the  commit¬ 
tee  responsible  should  be  prepared  to  defend  its 
conclusion. 

As  previously,  each  individual  has  responsi¬ 
bility  for  monitoring  a  2  X  2  block  of  squares. 
To  compensate  for  the  reduction  in  total  number  of 
individuals,  we  allow  a  higher  communication  rate. 
A  typical  scenario  might  go  as  follows.  An  indi¬ 
vidual  reads  his  sensor  report  and  decides  to  do 
one  of  four  things.  He  will  remain  silent  if  he 
has  nothing  significant  to  say  or  if,  in  his  opin¬ 
ion,  he  should  delay  communicating  temporarily. 
He  will  formulate  an  hypothesis  and  volunteer  to 
form  and  chair  a  committee  if  he  is  the  first  to 
recognize  the  possibility  of  needing  a  new  commit¬ 
tee.  If  he  belongs  to  a  committee  with  pending 
matters,  he  may  contribute  useful  information  to 
its  chairing  individual.  Finally,  if  he  is  chair¬ 
ing  a  committee  but  believes  the  matter  before  it 
is  moving  out  of  its  jurisdiction  (because  of  lim¬ 
ited  communication  range  or  departure  of  the  mes¬ 
sage  from  its  purview),  he  may  attempt  to  enlist 
another  relevant  individual  to  form  and  chair  a 
committee  on  this  message.  In  this  way,  one  com¬ 
mittee  hands  off  its  problems  to  others. 

This  architecture  manifests  very  different 
properties  than  those  exhibited  in  the  hierarchi¬ 
cal  system.  Here,  knowledge  accumulates  by  form¬ 
ing  increasingly  large  committees.  At  any  point 
in  time,  the  committee  membership  reflects  those 
individuals  that  still  believe  they  have  an  in¬ 
terest  in  pursuing  a  message.  As  the  messages 
move,  the  committees  become  staffed  with  old.  ir¬ 
relevant  members  and  i  need  arises  to  hand  of: 
committee  merabersnip.  Higher-level  analyses  em¬ 
erge  as  the  committees  either  become  lorger  or  as 
new  ones  devoteu  to  more  abstract  djta  process. nz 
are  formed. 


The  advantages  of  this  architecture  seeo  to 
derive  primarily  from  its  reduced  number  of  indi¬ 
viduals  and  Lis  simplified  communication  network. 
On  the  other  hand,  it  entails  significantly  more 
communication  than  the  hierarchical  net.  Where 
the  hierarchy  could  use  high-level  assessments 
derived  from  many  sensors  to  comprehend  regional 
patterns  and  track  them  without  hand-off,  the 
anarchic  committee  design  must  constantly  hand-off 
tracking  problems  as  pacterns  move  from  one  area 
to  the  next.  This  introduces  problems  of  choosing 
individuals  to  hand  the  tracking  problems  to  aad 
re-establishing  committee  membership.  Moreover, 
the  need  to  form  higher-level  assessments  with  the 
same  individuals  who  are  performing  other  tasks 
seems  to  complicate  the  problem  of  indexing 
knowledge.  When  the  same  individual  plays  multi¬ 
ple  roles,  he  becomes  a  potential  bottleneck  for 
the  system. 

-.4  A  Large  Space  of  Design  Alternatives 

What  our  preliminary  analyses  have  indicated 
is  a  very  large  space  of  alternative  designs  for 
Dl-SA  architectures.  To  explore  this  space  is  the 
purpose  of  our  current  research.  The  two  simple 
straw-man  architectures  just  discussed  vary  pri¬ 
marily  on  one  dimension,  that  of  communication 
network  structure.  This  variation  can  be  seen  to 
influence  many  of  the  opportunities  for  adaptive 
behavior.  In  particular,  established  hierarchies 
simplify  tracking  of  motion  and  indexing  of  oth¬ 
ers'  knowledge.  In  contrast,  they  create  require¬ 
ments  for  effective  governmental  policies.  When¬ 
ever  such  policies  are  required,  we  anticipate 
their  creating  numerous  unanticipated  emergent 
properties . 

Our  current  conceptual  descriptions  of  DI  and 
SA  provide  a  rich  sec  of  dimensions  for  varying 
possible  architectures.  We  expect  that  the  com¬ 
munication  structure  will  provide  the  best  single 
experimental  variable  for  study.  This  belief  is 
based  on  the  fact  that  many,  if  aot  most,  charac¬ 
teristics  of  a  DI  system  are  responsive  to  the 
limited  communications  paths  between  individuals. 
We  expect  to  find  that  many  significant  DI  proper¬ 
ties  are  adaptations  to,  rather  than  the  cause  of, 
their  communication  capabilities.  In  addition,  we 
believe  we  can  manipulate  this  dimension  effec¬ 
tively,  in  both  human  and  machine  simulations  of 
DI-5A. 


5.  CONCLUSIONS:  TOWARD  DESIGN  HEURISTICS  FOR  DI 
AND  SA 

Our  basic  approach  to  DI  and  SA  has  been  ex¬ 
ploratory.  We  have  attempted  to  formulate  the 
principal  features  of  each  concept  and  to  relate 
these  two  together.  At  this  point,  we  believe  ex¬ 
perimental  and  analytical  investigations  of  the 
effects  of  alternative  communication  network 
structures  on  SA  performance  would  be  appropriate. 
In  formulating  our  experimental  benchmark  problem, 
we  have  emphasized  the  significant  interpretation 
problems  of  SA  and  Jeemphasized  the  lower-level 
signal  processing  problems.  This  reflects  our  be¬ 


lief  that  DI  is  an  important  candidate  for  per¬ 
forming  the  multilevel  interpretation  problems 
that  arise  in  battle  management.  These  problems 
require  bringing  diverse  sources  of  knowledge  to 
bear  in  understanding  activities  at  different  lev¬ 
els  of  abstraction.  In  turn,  we  see  the  outputs 
of  SA  being  provided  to  different  personnel  with 
correspondingly  varied  interests  and  needs 

In  our  overall  approach,  we  have  attempted  to 
develop  a  mapping  between  DI  and  SA  concepts  to 
suggest  architectural  principles.  Our  highest 
level  research  heuristic  suggests  viewing  DX  and 
SA  in  different  ways  to  develop  alternative,  in¬ 
teresting  analytical  or  functional  analogies.  For 
example,  viewing  DI  as  a  structure  and  SA  as  a 
problem,  the  following  heuristics  apply: 

HI:  Usually,  structures  evolve  to  support  the 
needed  functions;  thus,  consider  what  func¬ 
tions  accomplish  SA  problem-solving  and  then 
develop  supporting  structures. 

H2:  Good  solutions  typically  employ  modularized 
functions . 

H3:  Good  structures  support  modularized  functions 
by  a  set  of  independent  design  features. 

K4:  Once  a  list  of  independent  design  features  is 
accumulated,  these  may  be  realized  by 
separate  physical  entities  or  by  multifunc¬ 
tion  designs. 

We  could  formulate  other  views  of  DI  and  SA 
that  would  stimulate  other  design  heuristics. 
Some  alternative  views  of  DI  systems  would  consid¬ 
er  them  principally  as  processes,  individuals  or 
populations.  Alternative  views  of  the  SA  t3Sk 
would  emphasize  its  similarity  to  pattern  recogni¬ 
tion,  to  Gestalt  perception,  or  to  an  abstract  map 
(with  insets). 

From  our  preliminary  human  experiments  and 
analyses,  we  have  developed  some  tentative  design 
concepts  and  heuristics,  which  we  list  without 
further  elaboration: 

H5 :  Individuals'  diverse  perspectives  both  pro¬ 
vide  the  constraint  needed  for  effective 
problem-solving  and  engender  the  complexity 
of  cooperative  problem-solving. 

H6:  Global  (consensual)  perspective  can  best  be 
derived  by  abstracting  local  perspectives, 
and  the  resultant  abstractions  can  provide  jn 
effective  intermediary  for  indexing  others' 
knowledge. 

H7 :  Organizational  elements  should  adapt  dif¬ 
ferentially  to  variations  in  environmental 
complexity  by  restructuring  subsets  and 
selectively  allocating  resources  to  them. 

H8:  Emergent  properties  significantly  affect  the 
behavior  of  a  DI  system,  and  can  most  effec¬ 
tively  be  studied  empirically. 
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H9:  Close  couplings  of  individuals  facilitate 
their  communication  and  provide  the  principal 
basis  for  dynamic  organization. 

H10:  Assuming  adequate  memory,  comparable  indivi¬ 
duals  should  possess  identical  capabilities. 
Individuals  should  be  differentiated  when 
different  environmental  or  organizational 
niches  imply  specialization. 

HI 1 :  Uncertainty  can  best  be  resolved  by  attending 
to  the  most  (conditionally)  diagnostic  infor¬ 
mation  at  each  point  in  time.  Gross  filters 
should  therefore  be  employed  before  fine 
ones;  these  filters  suggest  which  details  are 
worth  computing  and  which  are  not. 

Hli:  Hierarchical  networks  minimize  connunication 
in  interpretation  tasks.  However,  their  op¬ 
timal  structure  can  only  be  known  dynamical¬ 
ly,  after  the  data  are  presented. 

H12 :  Situation  assessment  can  be  significantly 
simplified  by  appropriate  use  of  expecta¬ 
tions.  Such  expectations  are  most  easily  im¬ 
plemented  by  passing  responsibility  for  them 
directly  to  individuals  who  could  locally 
confirm  or  disconfirm  them. 

H14:  SA  can  be  simplified  by  abstraction. 

HIS:  Spatially  hierarchical  nets  are  faster  than 
linear  neighbor-co-neighbor  communication 
paths . 

Hid:  The  same  individual  can  play  roles  at  dif¬ 
ferent  levels  of  abstraction,  but  this  in¬ 
creases  the  difficulty  of  knowing  which  indi¬ 
vidual  to  communicate  with. 

H16:  Centralization  is  undesirable  when  computing 
is  slow  relative  to  communication  or  communi¬ 
cation  is  difficult.  Otherwise,  centralized 
computing  is  desirable.  In  particular,  cen¬ 
tralized  committee  decisionmaking  is  good. 

HIT:  Consumer-producer  models  suggest  locating 
processing  near  supply  and  demand,  except 
when  it  is  much  cheaper  to  locate  it  else¬ 
where  (e.g.,  because  significant  economies  of 
scale  exist) . 

H18:  There  are  few  cases  where  communication  is 
pceferable  to  computing--in  research,  where 
it  is  desirable  to  avoid  rediscovering  what 
is  already  known;  or  in  any  situation  where 
expectations  obviate  computing. 
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A  sensor  Is  a  device  that  observes  physical 
phenomena.  The  output  of  a  sensor  Is  a  function  of 
these  observations.  Hera  we  are  concerned  with  the 
particular  problem  of  constructing  a  world  picture 
from  sensor  data.  The  world  of  Interest  contains 
craft  capable  of  notion  (e.g.,  airplanes,  water 
vessels,  and  land  forces)  and  the  terrain.  The 
Important  terrain  features  are  target  locations  and 
geographic  data  that  may  affect  the  operation  and 
behavior  of  sensors  and  craft.  The  world  picture 
comprises  descriptions  of  craft  behavior  and 
location  with  respect  to  the  terrain  and  to  each 
otner. 

There  are  many  families  of  sensor  devices.  Among 
them  are  pnased-array  acoustics.  Infrared,  sonar, 
and  many  kinds  of  radars.  The  major  distinguishing 
characteristic  among  different  sensor  families  is 
the  type  of  phenomena  that  they  can  observe. 

aacn  device  has  three  primary  types  of  components: 
;n  analogue  components— the  observation  collector 
such  as  a  simple  microphone  or  antenna.  Active 
sensors  also  Include  an  analogue  illuminator 
component;  for  example,  a  radar  generates  the 
signals  that  it  detects  after  reflection. 
(2)  digital  components— a  processor  that  turns  raw 
ooservatlons  Into  the  sensor's  output  such  as 
azimuth,  heading,  velocity,  position,  etc.  The 
digital  component  may  also  control  and  coordinate 
the  analogue  components.  (3)  interface 
components— accomplish  delivery  of  the  sensor's 
output  to  the  user.  This  function  often  includes 
graphic  display  and  communication  with  a  central 
facility. 

The  remainder  of  this  paper  describes  the  concept 
of  Distributed  Sensor  Networks — OSN . 

mi  >aE  raE  aflaas? 

Current  sensor  technology  naa  reaoned  a  very 
sophisticated  state.  Radars  are  sensitive  enough 
to  detect  the  differential  Doppler  shift  between 
the  top  and  bottom  of  e  tire  on  a  moving  vehicle, 
and  pnased  acoustic  arrays  can  listen  to  entire 
bceans.  However,  many  problems  remain. 

-»i«  Ci i»«r»w  Problem.  Aa  mentioned  above,  radars 
can  be  very  accurate.  On  the  ether  hand,  a  radar's 
normal-aircraft  detection  radius  la  limited  to 
about  one  hundred  kilometers;  for  low-flying 
aircraft  the  coverage  ia  much  more  limited. 
Acoustic  arrays,  however,  may  have  muon  greater 
range,  particularly  the  underseas  variety,  but  they 
are  no  rival  to  a  radar’s  accuracy.  Ho  single 
sensor  produces  optimal  coverage.  Between  sensor 


families  there  are  tradeoffs  among  area,  accuracy, 
and  sensitivity. 

The  Vulnerability  Problem.  As  one  might  guess, 
powerful  sensor  devices  are  expensive.  Therefore, 
they  are  few  in  number,  and,  in  general,  only  one 
or  a  few  cover  any  given  area.  This  creates  the 
vulnerability  problem.  If  a  single  device  fails 
(or  is  knocked  out  by  enemy  action)  a  critical 
region  may  be  left  uncovered.  Even  wnen  multiple 
devices  cover  any  given  area,  the  system  is  still 
vulnerable  if  a  centralized  facility  is  responsible 
for  data  coordination  because  the  centralized 
facility  itself  may  fail. 

The  Dae  of  Information  Prohiem.  This  is  also 
called  the  inference  problem.  In  many  instances, 
extra-sensor  Information  is  available.  For 
example,  an  intelligence  report  states  that  a 
particular  submarine  is  very  likely  to  cruise  in  a 
particular  area.  How  is  this  input  to  he  used  by  a 
sensor  to  more  accurately  determine  the  world 
picture?  For  another  example,  consider  the  problem 
of  correlating  data  from  different  sensors  that 
cover  a  common  area — say  microphones  and  a  radar. 
How  is  the  data  combined,  particularly  if  multiple 
craft  are  in  the  area? 

The  Interface  Problem.  The  output  of  sensor 
devices  can  be  used  in  a  multitude  of  ways. 
Current  practice  delivers  the  information  to  a 
human  user.  The  user  makes  decisions  based  upon 
this  output  and  is  able  to  exercise  limited  control 
over  the  functioning  of  the  sensor.  The  trend  is 
toward  more  sophisticated  systems  in  wnich  many 
obssrvations  and  control  functions  are  automated. 
At  the  point  where  the  human  user  interacts  with 
the  automated  system,  the  data  is  highly 
correlated— inferences  have  been  drawn  and  acted 
upon.  The  user  has  a  need  to  ascertain  the  method 
employed  by  the  system  to  reach  its  present  state. 
That  is,  the  system  must  be  able  to  explain  and 
dufsnrt  its  behavior.  Further,  the  system  must  be 
able  to  Initiate  dialogue  rather  than  remain 
passive  in  the  interaction,  especially  in  tne  face 
of  unusual  uncertainty  or  during  a  crisis.  This 
becomes  particularly  important  as  sensor  systems 
undertake  craft-guidance  functions. 

mu  "net  Altnougn  the  cost  of  procsssing 
( particularly  for  small  to  medium-sized  computers) 
nas  been  drastically  reduced  by  LSI  technology, 
there  has  been  no  corresponding  reduction  in  the 
cost  of  anologus  components  and  packaging  for 
sensor  devices.  At  issue  is  tne  development  of 
system  architectures  mat  take  advantage  of  the 
current  and  projected  cost  trends. 
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The  next  section  discusses  distributed  sensor 
networks,  a  possible  approach  to  overcoming  many  of 
these  problems. 


Many  of  the  problems  discussed  in  the  preceding 
section  can  be  addressed  by  using  sensor  networks. 
A  sensor  network  is  a  group  of  sensor  devices 
connected  by  a  communications  network  trying  to 
achieve  a  common  goal— deriving  an  accurate  world 
picture.  Several  tnings  need  to  be  understood: 
First,  the  area  covered  by  the  sensors  on  the 
network  is  larger  tnan  any  one  of  them  can  cover 
(at  least  cover  well).  Second,  there  are  some 
areas  that  are  covered  by  more  than  one  sensor.  If 
this  is  not  the  case,  the  sensors  can  operate  in  a 
virtually  autonomous  manner,  and  the  system  will 
neither  enjoy  all  the  benefits  nor  suffer  all  the 
problems  discussed  below.  Third,  the  number  of 
sensor  devices  in  the  system  is  not  trivially 
small . 

Problem  Scope.  We  wish  to  enlarge  the  scope  of  the 
problem,  .lamely,  the  following  types  of  non-sensor 
components  are  to  be  considered  examples  of 
potential  members  of  the  network:  (1)  Command, 
Control,  and  Communications  (C3)  systems, 

(2)  observers  with  portable  terminals,  and 

(3)  cruise  missiles— a  combination  of  a  craft  and 
a  sensor  device. 

The  rationale  far  considering  the  above  types  of 
components  is  that  the  military  is  moving  towards 
large  integrated  information-processing  systems  for 
the  1980s.  Sensor  networks  per  se  represent  only  a 
part  of  the  total  system.  Each  of  the  additional 
component  types  expands  the  demand  for  cooperation 
and  system  feedback.  Further,  they  all  represent  a 
variety  of  usages  (contexts)  for  the  sensor  system. 

In  order  to  evaluate  and  compare  different 
sensor-system  architectures,  it  is  necessary  to 
view  them  in  the  environment  of  their  applications. 
For  example,  a  system  that  produces  only  nign 
duality  output  for  critical  regions  is  probably 
more  valuable  than  a  system  that  generates  uniform 
but  mediocre  results  everywhere.  To  form  such  an 
objective  (versus  subjective)  merit  function,  it  is 
necessary  to  Include  the  extended  components  wnose 
operations  may  represent  the  payoff  of  the  entire 
system. 


and  forth  via  the  network.  Since  the  data  base 
itself  resides  only  at  a  single  location,  problems 
of  consistency  are  minimal .  The  major  problems 
with  this  mode  of  operation  are  the  delays  caused 
by  the  network  and  the  vulnerability  of  the  entire 
system  if  the  single  host  housing  the  data  base 
falls . 

Decentralization  is  a  paradigm  in  which  no  central 
global  view  is  maintained  and  system  control 
functions  are  not  centralized.  This  type  of 
organization  can  occur  on  a  single  processor  when  a 
computation  Is  performed  by  separate  but  equal 
proceasea.  Even  though  In  this  case  there  exists 
an  operating  system  that  actually  allocates 
resources  and  settles  Implicit  conflicts  among  the 
processes,  the  details  of  such  scheduling  and 
conflict  resolution  are  not  relevant  to  the  correct 
operation  of  the  processes  If  they  are  well-formed 
for  decentralized  computation.  This  scheme  is 
often  used  to  model  AX  problem  solutions  and  team 
efforts.  The  members  of  the  team  are  represented 
as  independent  experts  with  their  individual 
viewpoints  and  problem-solving  methods.  They 
cooperate  when  it  Is  appropriate,  and  work  on  their 
own  when  that  appears  best. 

Decentralization  is  also  compatible  with 
distribution  of  processing.  For  the  data  base 
example  above,  a  natural  extension  is  to  distribute 
the  data.  For  example,  different  offices  of  a 
large  corporation  maintain  local  data  bases. 
Computatlbns  that  involve  data  Troa  more  than  one 
office  use  a  communications  facility,  the  network, 
to  transfer  the  necessary  information.  In  this 
situation,  there  is  a  problem  of  consistency, 
particularly  If  external  data  or  inferences  drawn 
from  It  are  stored  for  any  length  of  time — updates 
are  not  simultaneously  (if  at  ail)  reflected 
everywhere  the  data  resides.  On  the  other  hand, 
the  decentralized  facility  provides  marked 
efficiency  gains  when  most  computations  involve 
only  local  data;  the  system  is  also  more  robust. 
Failure  of  no  subset  of  the  sites  cripples  all  the 
capabilities  of  the  entire  system.  Further,  there 
appesrs  to  be  a  cost  advantage  to  using  many  medium 
scale  processors  versus  a  single  large  processor. 
Whether  this  cost  savings  is  real  or  only  apparent 
depends  upon  the  future  of  the  hardware  component 
revolution  and  the  costs  of  communications 
mechanisms. 


Platriautlaa  and  JpscatralUafclan-  These  two 
similar-sounding  but  quite  distinct  concepts  are 
often  confused.  Distributed  processing  or 
architecture  lenotes  a  paradigm  In  which,  although 
the  computation  Is  dispersed,  a  global  (perhaps 
hierarchical)  view  and  centralized  control  are 
maintained.  A  typical  example  is  cooperative  tasks 
on  a  multiprocessor  with  a  shared  memory.  A  major 
problem  here  is  conflict  resolution  for  the  shared 
resources,  a  problem  normally  solved  explicitly  by 
synchronization  primitive  use  by  the  tasks 
themselves.  Another  class  of  distributed 
processing  occurs  with  networks.  For  example,  a 
process  on  one  machine  (a  host)  maintains  a  lata 
case.  Processes  on  other  hosts  use  the  data  base 
process  ilk#  a  subroutine,  passing  information  back 


For  a  variety  of  reasons,  it  is  obvious  that  sensor 
networks,  which  are  by  definition  distributed, 
should  also  be  decentralized.  Decentralization  is 
a  viewpoint  on  the  management  of  system  complexity. 
As  such,  it  dictates,  to  a  degree,  a  style  of 
organization  and  Interaction  among  system 
components.  Many  (but  not  necessarily  all)  of  the 
components  must  be  capable  of  autonomous  operation, 
working  alone  when  circumstances  dictate  and 
cooperating  at  other  times.  In  addition,  the 
reliadllity  and  responsiveness  of  other  components 
will  be  iouoted;  tnat  is,  a  component  should  be 
able  to  continue  functioning  in  the  face  of  system 
degradation.  From  these  considerations  flow  t.-.e 
advantages  of  a  system  that  is  both  distributed  and 
decentralized. 
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Shared  Utilization.  It  should  be  possible,  without 
reorganizing  a  DSN,  to  add  new  consumers  of  its 
output.  In  principle,  this  expansion  is  no 
different  than  expanding  the  DSN  Itself,  because  in 
both  cases  there  is  a  need  to  share  results  and 
determine  common  goals. 

Extended  Coverage.  In  a  similar  rein,  a  DSN  should 
be  able  to  employ  sensors  from  several  families. 
Components  performing  higher-level  functions  such 
as  tracking  and  threat  evaluation  are  presumably 
using  information  from  a  variety  of  sources— 
sensors,  Intelligence,  etc.  Geographically 
distributed  information  is  also  available  from  the 
network  and  can  be  used  to  enhance  the  scope  of 
information  produced  as  well  as  its  accuracy  and 
its  timeliness.  The  simplest  form  of  this  is  a 
component  in  one  region  notifying  the  component (a) 
in  another  region  of  an  approaching  craft. 

More  3fiUaBlS  Syateaa.  Because  of  decentral¬ 
ization,  the  opportunity  to  construct  very  reliable 
systems  exists.  Redundant  and  overlapping 
operation  Increases  protection  against  failure; 
centralization  has  the  opposite  effect.  Because  of 
tne  flexibility  inherent  in  a  DSN,  the  system  can 
dynamically  reorganize  itself  and  reallocate 
responsibilities  in  the  face  of  degradation  or 
specific  threat  identification. 


order  to  properly  function  at  all,  a  DSN  must  use 
protocols  that  allow  requests  for  information  at 
many  different  degrees  of  completeness  and 
accuracy.  Since  the  components  are  loosely 
coupled,  it  would  be  odd  to  impose  rigid 
communications  requirements.  It  is  exactly  this 
kind  of  protocol  that  is  needed  also  at  the  system 
Interfaces  in  order  to  explain  and  display  system 
benavlor.  Such  a  system  must  be  capable  of  drawing 
inferences  from  tne  data  It  does  receive  and  have  a 
rational  mechanism  for  determining  what  to 
communicate  and  what  data  to  request.  This  is  not 
to  say  that  decentralization  solves  these  problems, 
but  rather  that  If  a  DSN  is  to  be  constructed,  the 
problems  must  be  solved.  The  solutions  are  then 
available  for  a  variety  of  usages. 

Coat  Benefits,  a  DSN  is  by  nature  processing  and 
communication  intensive.  The  cost  of  these  items, 
particularly  processing,  is  dropping  at  a 
remarkable  rate.  It  Is  presumed  that  the  very 
expenalve  components,  e.g.,  large  radar  antennas, 
can  be  displaced  with  cheaper,  less  capable 
versions  and  sensors  from  less  costly  families. 
The  accuracy  at  Individual  sltas  will  be  redeemed 
through  the  cooperation  of  several  times  as  many 
cneaper  components  augmented  by  Intelligence  and 
sophisticated  proceaaing.  Testing  this  hypothesis 
is  one  of  the  major  goals  of  the  DSN  investigation. 

Mew  Problems.  Several  type  of  craft  observation 
proolems  exist  today  that  aopear  outside  the 
ability  of  current  sensor  systems.  A  section 
below,  "DSN  Scenarios,"  discusses  several  sucn 
problems  that  can  be  better  handled  by  DSN. 

In  order  to  accrue  these  benefits,  it  is  necessary 
for  the  workers  on  DSN  projects  to  carefully  define 


the  problems  and  work  toward  their  solutions, 
next  section  describes  the  ISI  approach. 


Our  approach  to  analyzing  the  DSN  problem  is  to 
first  develop  a  DSN  framework.  The  framework  is  a 
meta-model  that  defines  the  problem  space.  As 
suob,  it  describes  the  kinds  of  components  and 
phenomena  that  are  to  be  considered.  Included 
besides  craft  and  sensor  equipment  are  the  terrain, 
environment ,  and  extended  components  mentioned 
above.  The  framework  is  then  augmented  by  the 
addition  of  specific  information  to  become  a  model 
for  a  specific  DSN  system.  Naturally,  many  such 
models  can  be  formed.  The  framework  and  models  are 
represented  in  a  notation  being  developed  for  this 
purpose. 

The  notation,  like  other  modelling  languages, 
allows  procedural  descriptions  of  components  and 
their  interactions.  However,  to  allow  the 
framework  to  be  written,  more  is  necessary.  An 
assertion  language  is  also  part  of  the  notation. 
An  assertion  is  a  statement  or  relation,  in 
mathematical  symbolism  (an  extended  first  order 
predicate  calculus),  that  allows  specification 
without  the  details  of  a  pr*^cedure-  For  example, 
an  assertion  in  the  framework  can  stats  that 
associated  with  a  sensor  is  a  radius  of  sensitivity 
and  that  tha  sensor  is  not  affected  by  any  physical 
events  outside  that  range.  In  a  model,  particular 
sensors  have  specifically  stated  radii,  and  a 
consistency  condition  is  that  of  the  assertion  to 
the  procedure  representing  the  sensor.  An  expected 
benefit  of  the  assertion  language  Is  an  enhanced 
ability  to  analyze  properties  or  DSN. 

As  is  well  known,  analysis  Is  not  likely  to 
determine  all  (or  even  many)  properties  of  very 
complex  systems  such  as  DSN.  Therefore,  the 
modelling  notation  has  been  designed  to  be 
executable  by  a  simulation  system.  This  system  Is 
called  tbe  DSN  (software)  testbed.  Simulation 
results  will  be  used  to  augment  and  test  analysis 
results— and  conversely.  Througn  this  approach.  It 
is  hoped  that  Insights  into  possible  DSN 
arcltectures  and  system  tradeoffs  can  be  developed. 


In  order  to  use  the  testbed  simulation  system, 
problems  for  a  DSN  to  solve  must  be  developed. 
These  problems  are  called  scenarios.  a  good  set  of 
scenarios  must  exercise  a  proposed  system  model 
both  in  the  normal  modas  of  operations  and  at  the 
boundaries  of  its  projected  capabilities.  Below, 
several  scenarios  are  briefly  described.  They  were 
selected  not  for  their  completeness  out  because 
they  present  proolems  not  adequately  handled 
without  a  DSN  approach. 

Low-Flvlng  Air— aft.  This  is  a  prool.es  for  two 
reasons:  First,  terrain  features  sucn  as  hills 
obscure  sensors'  views;  thus,  the  set  of 
observations  is  fragmented  in  time.  Second,  -sdars 
have  severe  olutter  problems  wnen  aimed  ’. ow  along 
tne  horizon.  Thus,  many  false  targets  present 
themselves  and  signal  reflections  become 
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troublesome.  To  solve  these  problems,  a  system 
must  os  able  to  combine  disconnected  observations 
having  a  nigh  error  rate.  DSN  should  be  able  to 
outperform  conventional  systems  since  sore 
fragments  of  information  are  available  from  which 
to  construct  a  track.  This  is  particularly  true  if 
multiple  sensors  cover  the  area,  and,  therefore, 
more  readings  from  different  viewpoints  are 
available  over  a  longer  time  span  as  the  craft 
moves  from  one  area  to  another. 


Multiple  Craft.  Tracking.  This  is  a  problem 
currently  receiving  much  attention.  When  more  than 
one  craft  is  in  a  single  sensor's  field  of  view,  it 


may  be  difficult  to  distinguish  them.  Some 
tracking  algorithms  also  run  into  serious 
computation  time  limitations,  A  DSN  should  be  able 


to  use  information  from  many  sensors  over  time  to 


setter  construct  a  realistic  world  picture. 


wont i a  Sensors.  There  are  several  manifestations 
of  this  problem,  e.g.,  observers  with  portable 
terminals  and  intelligence-gathering  craft  (ours). 
In  each  case,  either  the  sensor's  current  location 
must  be  accurately  determined  or  its  output  must 
somehow  be  used  given  the  inherent  inaccuracy  of 
its  measurement  base.  The  system  may  also  want  to 
exercise  control  over  the  mobile  sensor,  requesting 
it  to  move  to  a  critical  region.  Thus, 
intelligence  must  be  used  both  in  interpreting  the 
sensor  output  and  in  utilizing  the  sensor  to 
greatest  advantage. 


Complete  System-  In  this  set  of  scenarios, 
higher-level  components  such  as  C3  systems  appear. 
This  component  has  access  to  intelligence  outside 
the  sensor  sources.  The  problem  Is  to  use  the  DSN 
to  confirm  or  deny  hypotheses  generated  from  the 
intelligence  and  to  use  the  Intelligence  to 
generate  a  more  accurate  world  picture. 

Sate  of  Change.  The  time  it  takes  to  spot  and 
identify  craft  depends  upon  the  type  of  craft  and 
the  type  of  sensor  involved,  for  example,  a  radar 
should  detect  an  aircraft  flying  at  normal 
altitudes  within  seconds.  On  the  other  hand, 
underseas  events  unfold  more  slowly,  and  the 
environment  is  mors  noisy.  Thus,  it  may  take 


minutes,  hours,  or  even  days  to  accurately  locate  a 
submarine  trying  to  avoid  detection  by  operating  at 
low  speeds.  Fast  detection  situations  are 
relatively  simple;  it  Is  known  Immediately  whether 
the  data  is  important  or  not.  In  the  other  case, 
data  that  is  only  potentially  useful  must  be  stored 
for  long  periods  of  time,  For  a  DSN,  this  poses 
the  issues  of  wnere  to  store  the  data  and 
establishing  responsibility  for  monitoring  events 
over  long  time  periods.  It  should  be  noted  that 
the  problems  of  long-term  and  short-term  detection 
cannot  be  completely  decoupled.  For  inscance,  an 
aircraft  flying  near  a  body  of  water  may  create 
phenomena  that  are  observed  by  an  underseas  sensor. 
If  the  presence  of  the  aircraft  Is  not  correlated 
with  the  underseas  data,  large  amounts  of 
information  could  be  stored  away  oy  a  conservative 
wait-and-see  policy. 

Th»  VnU  Case.  No  set  of  scenarios  for  signal 
detection  systems  is  complete  without  inclusion  of 
the  null  case.  For  our  purposes,  this  means  no 
craft  activity  in  tne  area  of  the  DSN’s 
sensitivity.  The  results  of  such  a  simulation  help 
to  characterize  the  profile  of  likely  Type  II 
errors—errors  of  commission. 
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Many  important  possible  advantages  of  DSN  systems 
are  discussed  above.  However,  many  problems 
remain.  These  are  described  by  X.  Temlni,  "DSN: 
An  Attempt  to  Define  the  Issues,"  in  these 
proceedings. 


To  achieve  the  hoped-for  benefits,  it  appears  that 
DSN  systems  must  be  decentralized  in  addition  to 
being  distributed.  Methodology  for  organizing  and 
implementing  sucn  systems  is  not  of  the 
of f-the-shel f  variety.  Much  work  remains  to  be 
done.  In  particular,  methods  of  communicating  for 
decentralized  systems  that  are  compatible  with  the 
desired  flexibility  and  robustness  must  be 
developed.  At  ISI,  we  are  engaged  in  producing  a 
testbed  so  that  Ideas  and  strategies  for  DSN 
architectures  can  be  investigated  and  competing 
concepts  compared. 
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STRAWMAN  DESIGN  OF  A  DSN  TO  DETECT 
AND  TRACK  LOW  FLTING  AIRCRAFT 


By 

R.  Lacoss  and  R.  Walton 
Lincoln  Laboratory 

Massacnusetts  Inatitute  of  Technology 
Cambridge,  Maaa.  02142 


i.  aiaflcaaisa- 

A  preliminary  high  level  deaign  for  a  D8N 
capable  of  detection  and  tracking  of  low  flying 
aircraft  and  cruise  missiles  la  being  prepared. 
Progreaa  in  thia  activity  la  reported  in  this 
paper.  The  deaign  incorporatea  small  radar  and 
acoustic  array  aenaora  and  makes  use  of  packet 
radio  as  the  basic  communication  mechanism.  The 
deaign  is  for  a  system  which  might  be  developed 
and  demonstrated  by  the  aid  1980s.,  However, 
there  is  at  this  time  no  plan  to  actually 
develop  the  system.  The  idea  is  that  an  at¬ 
tempted  design  of  a  realistic  deployable  system 
is  one  of  the  beat  ways  to  identify  basic  DSN 
problems  and  to  evolve  solutions.  We  expeat 
that  in  the  future  the  design  process  will  be 
iterated  several  times  with  major  system  changes 
resulting  from  lessons  learned  during  earlier 
design  phases.  Repeated  designs  and  system 
evaluation  without  commitment  to  or  actual 
development  of  a  specific  sytem  should  be  a  dost 
effective  way  to  develop  DSN  technology  and  to 
eventually  develop  a  specific  DSN  which  is  far 
more  optimal  than  could  be  achieved  by  immediate 
commitment  to  a  specific  system  development  pro¬ 
gram.  The  design  is  a  true  strawman.  It  is  ex¬ 
pected  to  evolve  into  a  design  for  a  system 
which  could  be  developed  but  it  is  expected  that 
the  required  changes  may  be  very  major  indeed. 

The  design  summarized  in  the  other  sections 
of  this  paper  constitutes  our  current  version  1A 
DSN  design.  In  the  coming  year  the  version  1A 
design  will  be  the  stepping  off  point  for  system 
analysis  and  for  software  testbed  simulations 
which  will  be  used  to  further  develop  the  design 
and  demonstrate  how  the  system  represented  ay 
the  design  might  perform.  The  simulated  systems 
will  be  denoted  versions  IS,  1C,  etc.  The  plan 
is  that  they  should  all  be  outgrowths  of  the  1A 
design  but  may  differ  substantially  from  it. 

Future  plans  also  call  for  the  development 
and  demonstration  of  an  experimental  DSN  con¬ 
sisting  of  three  or  more  nodes  with  real  sen¬ 
sors.  Such  an  experimental  system,  at  least 
partially  because  of  the  limited  number  of 
nodes,  will  be  somewhat  different  from  the  large 
scale  systems  represented  by  the  version  1  sys¬ 
tems.  The  design  discussed  here  is  not  for  an 
experimental  system  but  for  a  hypothetical  full 
scale  system. 


XX.  VERSION  14.  DESIGN  FEATURES. 

The  following  are  major  features  to  be  ex¬ 
hibited  by  the  version  1A  design  or  follow-on 
designs  to  be  demonstrated  in  a  software 
testbed: 

>The  System  la  to  Detect,  and  Track  Low  FIvi  n» 
Aircraft. 

The  primary  task  of  the  DSN  is  to  detect, 
locate,  track  and  identify  low  flying  aircraft. 
It  is  to  make  such  surveillance  data  selectively 
available  to  users  who  may  be  within  the  DSN  or 
outside  of  it.  Subject  to  physical  constraints 
dictated  by  sensor  capabilities  the  surveillance 
data  is  to  be  made  available  in. real  time  with 
only  small  delays  for  processing  and  communica¬ 
tion. 

>Use  of  Multiple  Cooperative  Sensors  of  l.lml  ted 
Individual  Capability. 

The  system  will  make  use  of  sensors  which 
are  individually  not  capable  of  furnishing  the 
required  surveillance  information.  At  the 
present  time  attention  is  focused  upon  short 
range  radars  with  very  limited  azimuth  measure¬ 
ment  capability  and  small  acoustic  arrays  which 
can  measure  the  azimuth  of  sound  sources.  Data 
from  multiple  sites  will  generally  be  required 
to  locate  and  track  a  target  and,  as  a  target 
moves  through  the  DSN,  the  set  of  sensors  re¬ 
porting  new  measurements  will  continuously 
change. 

distributed  Intfllllggncc  ii  Uafifl.  £ac  lata 

Screening. 

The  trwendoua  amount  of  raw  data  generated 
by  large  numbers  of  sensors  requires  that  com¬ 
puter  data  screening  and  analysis  replace  manual 
inspection,  and  that  this  be  done  near  the  sen¬ 
sor  because  limited  communications  bandwidth  al¬ 
lows  only  near  final  results  to  be  communicated. 

>Adantion  ia  &SH  Configuration  jnd  IflflA 

Variation- 

One  potential  advantage  of  a  system  which 
makes  use  of  very  large  numbers  of  distributed 
sensors  and  which  also  distributes  the  surveil¬ 
lance  function  is  the  ability  to  adapt  to  and 
continue  effective  operation  in  the  face  of 
failure  of  various  nodes  in  the  system.  The 
node  failures  asy  be  natural,  due  to  damage 
resulting  from  hostile  action,  or  communication 
failures  resulting  from  jamming.  Adaption  to 
changing  situations  is  a  generic  feature  of  DSNs 
and  must  be  included  in  version  1  designs. 

>Packet  Radio  la  Used  £ar  Digital  Communication. 

Packet  radio  is  a  suitable  communication 
system  meeting  the  requirements  of  raoid  deploy¬ 
ability,  aoderately  nigh  rate  of  secure  digital 
data  transfer,  and  unlimited  system  size.  It  is 
also  one  of  several  important  research  efforts 


a 


'A '.A'V. A v; .» 


in  digital  communication,  and  needs  evaluation 
m  a  variety  of  different  application  systems. 

>Verv  Large  Amounts  of  Computer  Power  are 

ABPilad  aa  Haceaaafi- 

Large  gains  In  the  utility  of  sensors  can 
be  made  using  sophisticated  computer  processing 
if  the  raw  data,  and  large  amounts  of  computer 
processing  power  can  now  the  made  cheaply  avail¬ 
able  using  recently  developed  integrated  circuit 
technologies.  As  a  result,  a  large  amount  of 
computing  power  can  be  located  with  each  sensor, 
with  potential  for  improved  sensor  performance 
and  good  results  from  cheaper  sensors. 


HI.  SYSTEM  CQNEIGIffiATICN  OPTIONS. 

There  are  three  different  general  confi¬ 
gurations  for  DSNs.  These  are  area,  linear,  and 
honeycomb.  The  distinction  between  all  of  these 
are  indicated  in  Figure  1 .  Figure  2  shows  the 
area  configuration  in  more  detail.  In  the  fig¬ 
ure  the  basic  spacing  of  10  Km  Is  the  nominal 
spacing  for  the  1A  system.  In  the  1A  system 
eacn  node  has  a  packet  radio,  substantial  pro¬ 
cessing  capability,  an  acoustic  array,  and/or  a 
small  radar.  The  basic  10  km  spacing  has  been 
selected  to  be  consistent  with  the  communication 
system  capabilities,  acoustic  sensor  capabili¬ 
ties,  and  terrain  masking  difficulties  for  low 
flying  aircraft.  Also  shown  in  the  figure  are 
nodes  on  a  50  km  spacing  grid  which  are  denoted 
as  sector  nodes-  These  are  surveillance  data 
collection  points  and  are  discussed  at  greater 
length  elsewhere  in  this  paper. 

The  10  km  distance,  like  other  parameters 
appearing  in  the  version  1A  design,  is  neither 
■unchangeable,  nor  is  it  readily  changed.  Chang¬ 
ing  the  value  can  have  substantial  global  side 
effects.  He  have  selected  a  single  value  and 
just  note  possible  difficulties  which  might  be 
encountered  in  increasing  or  decreasing  the 
value . 

Decreasing  the  10  km  value  results  in  DSN 
systems  which  may  be  less  cost  effective.  Much 
of  the  node  cost  is  electronic,  and  subject  to 
tremendous  decreases  in  price  over  the  next  10 
years.  VLSI  chips  may  someday  result  in  the 
ability  to  furnish  node  processing  for  only  a 
few  thousad  dollars.  In  this  case  ouch  smaller 
spacing  options  may  become  sore  interesting. 
This  might  involve  an  acoustic  only  system  or 
the  use  of  extremely  low  power  cheap  omnidirec¬ 
tional  radars.  Decreasing  the  10  'km  value  would 
then  increase  problems  with  radars  interfering 
with  each  other.  This  should  not  be  a  problem 
for  the  strawman  system  since  the  radars  do  have 
3ome  limited  amount  of  directionality.  Increas¬ 
ing  the  10  km  value  get3  beyond  the  useful  range 
of  acoustic  sensors,  and  introduces  masking 
problems  with  radar. 

The  version  1A  design  effort  has  concen¬ 
trated  upon  the  area  coverage  geometry.  ?ri- 
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marily  this  is  because  the  uniform  distribution 
of  nodes  will  tend  to  generate  a  greater  commun¬ 
ication  load  than  other  configurations.  Thus  if 
the  comunication  issues  for  the  area  geometry 
can  be  handled  the  others  should  cause  no  diffi¬ 
culty.  The  design  could  be  adapted  to  either 
the  linear  or  honeycomb  stuation.  In  the  case 
of  the  linear  configuration  the  overall  system 
would  tend  to  be  much  simpler.  For  the  honey¬ 
comb  the  system  might  be  slightly  simpler  but 
the  problem  of  data  association  and  tracking 
across  sensor  free  areas  would  be  sore  diffi¬ 
cult.  With  respect  to  this  last  comment  note 
that  in  the  long  run  an  area  system  should  func¬ 
tion  when  large  holes  are  torn  in  the  net.  Thus 
the  area  system  ultimately  must  treat  this  prob¬ 
lem  of  association  and  tracking  across  sensor 
free  areas. 


UL-  OPERATIONAL  MODES. 

Our  strawman  DSN  has  both  small  acoustic 
arrays  and  small  radars  at  each  node.  This  pro¬ 
vides  for  operation  in  a  number  of  possible 
modes.  Following  is  a  brief  description  and 
discussion  of  three  such  modes.  The  discussion 
is  in  terms  of  modes  of  a  single  system  but  it 
should  be  clear  that  they  could  also  correspond 
to  systems  using  different  mixes  of  sensors  at 
the  nodes. 

Hcauatlfi  flalz  Surycill'anca  And.  Iras  tana  - 

Detection  and  tracking  can  be  accomplished 
using  only  measurements  made  by  small  acoustic 
arrays  located  at  each  system  node.  Individual 
arrays  are  used  to  get  a  series  of  single  node 
detections  and  estimates  of  possible  target  az¬ 
imuths.  Such  data  is  combined  from  several  such 
nodes  both  to  perform  surveillance  for  new  tar¬ 
gets  and  to  track  targets. 

In  such  an  all  acoustic  mode  the  system  is 
passive  except  for  the  communications.  It  would 
use  less  power  than  with  radars  in  operation  and 
the  elements  of  the  DSN  would  be  less  detectable 
than  with  the  radars  in  operation.  The  lower 
detectability  follows  because  radiated  power 
needed  to  communicate  over  10  km  distances  is 
very  much  less  than  the  radar  power  needed  to 
detect  targets  at  10  km  ranges.  However  in  an 
all  acoustic  mode  the  duality  of  present  loca¬ 
tion  estimates  and  predictions  of  future  posi¬ 
tions  would  probably  not  be  as  high  as  would  be 
possible  when  making  use  of  radar.  This  is  as 
least  partially  due  to  the  inherent  acoustic 
propagation  time  delay  to  the  sensors. 

>Radar  ini y.  Surveillance  AOl  I.-jgjii.-.g ■ 

Each  DSN  node  has  a  small  r3dar  whicn  can 
give  good  range  information  but  very  limited  az¬ 
imuth  and  no  elevation  information.  As  noted 
above  if  the  system  ooerates  using  only  these 
radar  sensors  it  would  consume  more  power,  ce 
more  detectable,  but  probably  be  caoable  of  more 
precise  estimates  of  present  and  future  posi- 


tions.  The  improved  locations  would  follow  from 
multisite  combining  of  range  measurements  and 
not  from  the  use  of  radar  azimuth  measurements. 
Since  the  radars  furnish  Information  for  both 
surveillance  and  tracking  the  most  obvious  mode 
for  individual  radars  is  to  continuously  scan 
and  search  for  targets.  The  system  can  perform 
tracking  by  combining  reports  from  the  individu¬ 
al  radars  which  are  physically  functioning  in 
the  search  mode. 

A  hybrid  mode  would  be  for  some  radars  to 
scan  some  of  the  time  and  to  physically  keep 
beams  pointed  at  specific  targets  some  of  the 
time  to  accomplish  better  tracking  of  already 
acquired  targets.  Multiple  sites  would  still  be 
required  for  good  location  but  range  measure¬ 
ments  for  targets  being  tracked  would  not  be 
constrained  to  one  observation  per  radar  scan 
time  in  the  surveillance  mode.  The  utility  of 
this  hybrid  mode  and  the  impact  upon  radar  eom- 
plextlty  and  cost  are  issues  to  be  considered. 

It  should  be  noted  that  the  hybrid  mode  as¬ 
sumes  enough  radar  redundancy  so  that  some  frac¬ 
tion  of  the  radars  can  be  diverted  from  the  sur¬ 
veillance  task  and  redirected  to  improve  track¬ 
ing  information  on  selected  targets.  This  ob¬ 
servation  la  true  Independent  of  whether  radars 
are  mechanically  or  electrically  steered  since 
the  transmitted  radar  beam  will  be  directed  in 
only  a  single  direction  at  any  one  time. 

Our  strawman  system  does  not  contain  a 
directional  acoustic  receiver  which  is  physical¬ 
ly  moved  to  point  at  targets.  Thus  there  is  no 
acoustic  only  hybrid  mode  which  is  entirely 
equivalent  to  the  hybrid  radar  mode  Just  men¬ 
tioned.  The  closest  equivalent  would  be  to 
selectively  process  data  for  different  direc¬ 
tions  and  frequencies  rather  than  to  routinely 
scan  the  entire  space  of  interest.  This  could 
potentially  reduce  processing  loads  but  we  do 
not  currently  know  how  to  do  this  without 
compromising  performance. 

^simultaneous  Asauatlc  lut  failar  Surveillance 
and.  Iracaag- 

Making  use  of  both  acoustic  and  radar  sen¬ 
sors  is  the  option  which  generates  the  richest 
system  alternatives.  Rather  than  try  to  mention 
a  large  number  of  them  we  cite  only  one  simple 
scenario  which  represents  a  prime  alternative  to 
be  investigated. 

To  minimize  system  detectability  and  to 
conserve  power  a  potentially  useful  node  of 
operation  is  to  perform  surveillance  using 
acoustic  sensors  and  to  operate  radars  only  upon 
cue  for  tracking.  A  system  operating  in  this 
way  would  handle  a  target  entering  the  DSN  from 
the  outside  the  coverage  area  as  follows.  Ini¬ 
tial  detection  and  rougn  location  is  obtained 
acoustically  as  the  target  is  entering.  The  ra- 
lars  in  the  immediate  target  area  and  adjacent 
areas  are  triggered  into  operation.  As  the  tar¬ 
get  goes  deeo  into  the  DSN  :he  cueing  of  radars 


will  be  from  extrapolation  of  target  tracks. 
Internal  acoustic  sites  will  furnish  backup  as 
wall  as  continuous  surveillance  and  target  sig¬ 
nature  information.  Our  current  view  is  that 
Individual  radars  will  operate  in  the  scanning 
mode  with  tracking  done  from  the  radar  measure¬ 
ments  obtained.  However  the  more  complicated 
radars  suggested  in  the  context  of  the  all  radar 
hybrid  mode  above  could  also  be  considered. 


1.  MULTISITE  PROCESSING. 

Multiple  site  target  detection  and  tracking 
functions  are  to  be  performed  at  every  DSN  node 
independently  of  other  nodes.  Each  node  will 
use  all  the  information  available  to  it  for  this 
purpose.  The  input  information  will  consist  of 
aanaor  reports  and  tracking  reports  received 
from  neighbor  nodes.  In  general  only  reports 
received  directly  on  a  single  radio  hop  from  a 
neighbor  will  be  required  or  used  for  basic  mul- 
tiaite  processing.  Figure  3  shows  the  12  nodes 
which  we  assume  would  nominally  receive  a  one 
hop  sensor  report  from  a  single  node.  This  as¬ 
sumption  is  discussed  more  in  Section  VII  of 
this  paper. 

The  sensor  reports  generated  by  a  node  sum¬ 
marize  target  information  which  can  be  obtained 
by  analysis  of  the  data  obtained  from  the  sen¬ 
sors  at  that  node  without  reference  to  any  other 
nodes.  For  acoustic  sensors  this  can  be  a  list 
of  the  most  likely  directions  for  targets  and  a 
summary  of  spectral  characteristics  for  sound 
from  each  of  those  directions.  For  radar  this 
can  be  a  list  of  possible  ranges  and  rough  az¬ 
imuths  plus  some  target  doppler  information. 
Tracking  reports  combine  sensor  reports  from 
several  nodes  plus  other  available  tracking  re¬ 
ports  to  give  actual  target  positions  and  tra¬ 
jectory  information  in  a  directly  usable  system 
coordinate  system.  The  tracking  reports  may 
also  summarize  other  target  signature  or  iden¬ 
tification  information. 

There  are  a  number  of  reasons  for  making 
every  node  a  multisite  node.  One  is  the  built 
-n  redundancy  which  this  achieves.  In  effect  a 
target  may  be  under  surveillance  by  and  be  in¬ 
cluded  in  the  tracking  reports  of  several  nodes. 
Loss  of  a  few  nodes,  apart  from  the  fact  that 
sensor  data  from  those  nodes  will  be  no  longer 
available,  will  not  have  a  serious  impact  upon 
system  performance.  Another  factor  is  the  anti¬ 
cipated  computational  load  of  multisite  process¬ 
ing  and  sensor  processing.  In  the  current 
design  the  computational  requirements  for  sensor 
processing  at  each  node  are  very  substantial.  A 
belief  yet  to  be  confirmed  is  that  basic  mul¬ 
tisite  processing  computational  load  is  not  sub¬ 
stantial  relative  to  the  basic  sensor  proceeding 
load.  Thus  the  cost  of  routinely  doing  multiple 
site  processing  with  high  redundancy  will  not 
substantially  add  to  computational  Hardware  in 
the  system.  Another  factor  involves  communica¬ 
tions.  With  each  node  being  a  aultisite  node  we 
are  able  to  design  a  system  in  ’which  ail  sensor 
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reports  are  made  on  a  single  communication  hop. 
3y  excluding  aultlple  hop  routes  for  sensor  re¬ 
ports  we  avoid  an  Increase  in  sensor  communica¬ 
tion  requirements  which  would  occur  for  aultlple 
hop  distribution. 

At  this  time  the  multisite  processing  of 
radar  data  has  not  yet  been  specified  and  only 
the  general  ideas  of  minimal  acoustic  aultlsite 
processing  have  been  considered.  We  include 
here  a  brief  discussion  of  an  approach  to  aul¬ 
tlsite  tracking  for  acoustic  data.  More  specif¬ 
ic  and  detailed  algorithms  and  procedures  will 
be  specified  in  the  process  of  implementing 
simulations  of  a  functional  DSN. 

First  we  consider  the  aultlsite  use  of 
acoustic  data  for  location.  Measureaents  of 
acoustic  azimuth  are  made  at  each  DSN  node  every 
two  seconds.  The  node  locally  tries  to  associ¬ 
ate  successive  observations  so  that  it  can 
deliver  sequences  of  azimuth  measurements  which 
are  associated  with  a  single  target.  The  node 
cannot  locate  the  target  but  it  can  often  fur- 
nisn  this  association  between  sequences  of  ob¬ 
servations.  Thus  the  multisite  process  has 
available  as  input  a  collection  of  lists  of  az¬ 
imuths  from  several  sensors  with  each  list 
corresponding  to  measurements  of  a  single  target 
from  a  single  node.  Some  lists  may  have  weak 
links  and  some  lists  which  should  be  a  single 
list  may  be  broken  into  two  or  more  due  to  dif¬ 
ficulties  in  correctly  associating  sequential 
observations.  The  basic  multisite  task  is  to 
use  these  data  to  locate  and  track  targets. 

As  described  in  more  detail  in  another 
workshop  paper  — "Multisite  Processing  of  Acous¬ 
tic  Azimuth  Measurements  to  Locate  and  Track"  by 
3.  Walton — each  azimuth  versus  time  curve  can  be 
mapped  into  a  possible  position  curve  In  the 
horizontal  plane.  A  surveillance  time  T,  ear¬ 
lier  than  the  present  tine,  is  associated  with  a 
possible  position  curve  and  the  mapping  from  az¬ 
imuth  versus  time  to  possible  positions  depends 
upon  T.  Imagine  that  T  is  fixed  and  that  all  of 
the  possible  position  curves  are  displayed  using 
a  different  color  for  each  contributing  node. 
It  should  be  relatively  easy  for  a  person  to  lo¬ 
cate  targets  using  such  a  display.  The  targets 
are  where  segments  from  different  nodes  inter¬ 
sect.  The  DSN  will  require  a  program  to  sift 
this  display  and  find  targets.  Results  reported 
in  the  workshop  paper  by  Walton  show  good 
results  with  only  2  nodes  and  only  one  target, 
provided  the  target  is  not  in  certain  areas  re¬ 
lative  to  the  2  nodes  where  its  position  becomes 
ambiguous . 

Exactly  how  radar  will  fit  into  the  picture 
depends  on  yet  undecided  details  of  the  radar 
and  how  it  is  to  be  used  in  the  system.  The 
strawman  radar  capability  discussed  elsewhere  in 
this  report  supplies  good  range  information  but 
poor  azimuth  resolution.  Each  such  radar  meas¬ 
urement  then  corresponds  to  a  possible  position 
curve  which  is  a  segment  of  circular  arc.  This 
fits  nicely  into  the  above  picture  if  the  time 


of  the  radar  observation  is  the  time  T.  Such 
radar  possible  position  curves  would  typically 
be  shorter  than  those  generated  by  acoustic  sen¬ 
sors.  This  could  be  used  to  resolve  ambiguities 
and  possible  ghosts  which  could  not  be  complete¬ 
ly  resolved  using  only  the  acoustic  data.  How¬ 
ever  ,  there  may  be  problems  when  the  radar  ob¬ 
servations  do  not  occur  at  time  T.  This  is  a 
problem  to  be  investigated  in  conjunction  with 
continuing  design  and  first  functional  simula¬ 
tions  of  a  DSN.  Dne  possibility  is  that  radar 
data,  as  mentioned  in  our  discussion  of  system 
modes,  will  not  be  used  extensively  for  initial 
target  detection  and  location  but  will  be  used 
only  for  refining  tracks  for  targets  (and  false 
targets)  tentatively  detected  and  located 
acoustically.  This  would  be  the  case  if  indivi¬ 
dual  radars  were  focused  upon  specific  limited 
potential  target  areas  rather  than  being  operat¬ 
ed  in  a  general  surveillance  mode. 

Independent  of  just  how  radars  are  used 
there  is  always  the  problem  of  refining  the 
track  estimate  of  a  target  given  the  ever  in¬ 
creasing  amount  of  data.  There  are  considerable 
possibilities  for  track  improvement  as  more  data 
arrives.  Most  likely  some  form  of  Kalman 
filtering  will  be  used  for  this  purpose. 


21.  REPORTING  IQ.  SECTOR  NODES. 

For  reporting  puroosea,  DSN  geometries  are 
divided  into  reporting  sectors.  Each  sector  has 
at  its  center  a  sector  node  with  which  system 
users  interface  either  directly  or  through  com¬ 
munication  links.  The  reports  from  tracking 
programs  operating  at  every  node  in  the  sector 
must  be  summarized  and  surveillance  information 
for  the  sector  must  flow  to  the  sector  node. 
The  flow  of  reports  from  the  edges  of  the  re¬ 
porting  sector  to  the  sector  node  is  by  means  of 
a  patterned  sequence  of  broadcasts.  During  nor¬ 
mal  operation  the  reports  within  a  sector  are 
collected  with  time  delays  on  the  order  of  a 
second.  Figure  A  shows  the  configuration  of  re¬ 
porting  sectors  for  normal  system  operation. 
Normal  operation  is  here  defined  to  be  operation 
when  all  sector  nodes  are  functional.  The  re¬ 
porting  sectors  are  defined  so  that  the  sector 
associated  with  a  sector  node  constitutes  all  of 
the  area  which  is  closer  to  that  sector  node 
than  any  other  sector.  As  shown  it  happens  that 
in  addition  to  the  sector  node  itself  the  re¬ 
porting  sector  contains  30  sensors  of  which  '2 
are  on  the  boundary. 

Reports  flow  from  the  edges  of  reporting 
sectors  to  the  sector  nodes  as  follows.  Each 
node  produces  and  broadcasts  a  report  roughly 
once  each  second.  This  report  is  the  best  re¬ 
port  summary  that  the  node  can  generate  for  an 
area  around  the  node  which  is  known  as  its  re¬ 
port  area  of  interest.  Figure  u  also  shows  the 
area  of  interest  for  a  typical  node  when  the  DSN 
is  in  full  operational  condition  with  ail  sector 
nodes  functioning.  Each  node's  report  covers  a 
particular  area  and  the  areas  for  different 


nodes  overlap  very  much.  In  prepairing  Its  re¬ 
port  a  node  uses  all  reports  It  has  heard  as 
well  as  its  own  multisite  processing  of  sensor 
reports.  With  reference  to  Figure  b  it  is  clear 
how  the  sector  node  can  generate  its  picture  of 
its  report  sector  from  the  reports  it  obtains 
from  neighbors.  It  should  be  noted  that  report 
coverage  is  very  redundant  so  the  sector  node 
can  get  reports  for  a  particular  part  of  its 
sector  from  many  paths.  This  will  automatically 
compensate  for  malfunctioning  nodes,  for  commun¬ 
ication  errors,  and  for  missing  paths.  Of 
course  the  sector  node  will  also  produce  and 
broadcast  its  sector  report  once  each  second. 

Under  normal  operation  as  described  above 
each  node  in  the  DSN  produces  and  broadcasts  a 
complete  summary  report  once  each  second  for  a 
region  of  the  same  size  and  shape  as  the  report 
sectors  associated  with  the  sector  nodes.  Each 
node's  report  is  limited  in  length  to  about  1500 
data  bits.  This  is  enough  to  accomodate  10 
targets.  If  there  are  up  to  20  targets  the  ex¬ 
tra  targets  will  be  reported  at  the  cost  of 
halving  the  reporting  rate  by  sending  half  the 
report  each  second.  Targets  in  some  region  may 
be  preferred  and  sent  every  second.  Beyond  20 
targets  it  will  be  necessary  for  the  node  com¬ 
puters  to  pick  targets  not  of  interest  to  be 
discarded  from  some  reports.  For  example  slow 
targets  may  be  reported  2  seconds  out  of  every 
3.  The  performance  of  the  system  is  not  optimal 
above  10  targets  and  the  system  behaviour  is  not 
yet  specified  above  20  targets. 

Saturation  of  the  1500  bits  of  a  report  by 
excessing  numbers  of  targets  is  a  problem  which 
can  be  more  serious  when  some  sector  nodes  are 
not  functioning  and  their  report  sectors  must  be 
absorbed  into  other  report  sectors.  Figure  5 
shows  the  redefinition  of  report  sectors  which 
results  when  a  single  sector  node  fails.  Each 
neighbor  has  increased  the  area  of  its  report 
sector  by  one  sixth  and  problems  can  result. 
The  report  area  of  interest  for  nodes  other  than 
the  sector  nodes,  if  unchanged  from  that  shown 
in  Figure  b,  are  not  sufficient  to  accomplish 
the  flow  of  surveillance  information  from  the 
area  near  the  failed  sector  node  to  the  still 
functioning  sector  nodes.  The  shape  of  the  re¬ 
port  area  of  Interest  could  be  maintained  and 
the  size  increased.  However  this  would  mean 
that  the  same  number  of  bits  would  be  available 
to  summarize  a  larger  area.  The  result  would  be 
a  reduction  in  the  target  density  which  could  be 
handled.  The  alternative  actually  selected  is 
to  allow  different  shapes  of  report  area  of  in¬ 
terest  for  different  nodes.  For  example  nodes 
near  a  functioning  sector  node  and  on  the  side 
toward  a  failed  sector  node  will  have  adjusted 
areas  of  interest  to  encompass  areas  further 
away  iowared  the  failed  node  without  increasing 
the  size  of  its  area  of  interest.  The  nature  of 
the  communication  pattern  used  to  cause  reports 
to  flow  to  sector  nodes  is  such  that  reports 
from  the  area  near  the  failed  node  will  reach 
functioning  nodes  with  delays  on  the  order  of 
two  seconds. 


These  issues  would  be  more  serious  if  more 
nearby  sector  nodes  were  to  stop  functioning. 

Til-  COMMUNICATIONS. 

The  DSN  system  will  make  use  of  packet  ra¬ 
dios  for  communication.  However,  the  DSN  appli¬ 
cation  and  traffic  is  quite  different  from  that 
for  which  packet  radio  networks  have  been 
designed.  The  normal  PRN  application  is  for 
large  numbers  of  independent  nodes  each  generat¬ 
ing  bursty  traffic  with  low  average  rates. 
Although  there  are  statistical  delay  specifica¬ 
tions,  occasionally  very  large  delays  may  be  ex¬ 
perienced  and  tolerated.  Errors  and  lost  mes¬ 
sages  are  universally  not  tolerated  and  the  sys¬ 
tem  is  designed  and  organized  to  enforce  this  as 
much  as  possible.  In  general  there  is  no 
quaranteed  performance  for  any  individual  user. 

The  DSN  requirements  differ  substantially. 
The  basic  DSN  is  an  organized  3et  of  nodes  work¬ 
ing  cooperatively.  It  is  most  likely  that  high 
target  density  in  an  area  will  cause  DSN  commun¬ 
ication  traffic  to  be  high  in  that  area.  Satis¬ 
factory  performance  must  be  assured  at  such 
critical  times.  In  particular  some  DSN  func¬ 
tions  will  require  assured  data  rates  and  delays 
even  in  the  worst  conditions.  Fortunately  the 
fact  that  the  DSN  nodes  are  not  independent 
should  allow  us  to  coordinate  their  communica¬ 
tion  and  thus  provide  this  essential  guaranteed 
response.  In  some  cases  lost  messages  or  mes¬ 
sages  received  in  error  (which  can  be  treated  as 
lost  messages)  may  be  somewhat  tolerable. 

In  this  section  we  review  a  very  simple 
packet  radio  modal  being  used  for  preliminary 
DSN  communications  system  design  and  analysis, 
review  the  major  DSN  communication  requirements, 
and  outline  a  strawman  approach  for  using  packet 
radios  to  furnish  the  required  service.  Only 
area  surveillance  with  a  uniform  grid  of  sensor 
nodes  is  discussed.  This  is  the  most  stressing 
of  the  DSN  system  system  geometry  options  from 
the  communication  point  of  view.  We  nave  used  a 
regular  hexagonal  grid  with  the  understanding 
that  it  must  be  possible  to  adapt  to  any  reason¬ 
able  regular  or  irregular  distribution  of  sen¬ 
sors. 

VII . 1  Assumed  Packet  Radio  Characteristics  fqr 
Initial  Design  Purge sea. 

The  Distributed  Sensor  Network  project  r.as 
decided  to  use  packet  radio  for  communications. 
In  general  the  characteristics  of  a  Upgraded 
Packet  Radio  are  assumed  if  they  differ  from  the 
current  packet  radio. 

Our  model  cf  the  packet  radio  Is  very  sim¬ 
ple.  We  assume  that  in  any  given  10  milli¬ 
second  period  a  packet  with  up  to  i 500  lata  tits 
can  be  transmitted.  The  actual  number  of  bits 
in  the  packet  may  be  as  large  as  20CC  with  the 
extra  500  being  error  control  and  other  informa¬ 
tion  not  considered  soecifically  to  be  lata.  We 
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assume  that  the  error  control  allows  the  re¬ 
ceiver  to  detect  and  discard  all  packets  re¬ 
ceived  in  error.  Nominally,  this  packet  might 
Se  heard  by  any  neighboring  node  10  to  20  km 
away,  with  error  rates  discussed  below. 

We  assume  that  the  time  a  packet  is 
transmitted  is  under  the  control  of  the  node's 
DSN  computer.  We  assume  that  a  node  cannot  re¬ 
ceive  more  than  one  packet  at  a  time.  We  also 
assume  that  if  two  nearby  neighbors  transmit 
overlapping  packets,  then  the  two  packets  will 
collide  and  neither  will  be  received  correctly. 
(We  do  not  use  the  potential  of  Upgraded  Packet 
Radio  for  receiving  the  first  packet  and  ignor¬ 
ing  the  second.) 

tor  the  purpose  of  preliminary  system 
analysis  required  to  confirm  that  the  DSN  design 
might  work  we  have  made- simple  assumptions  about 
communication  network  connectivity  and  probabil¬ 
ity  of  correct  reception  of  a  packet  over  a  sin¬ 
gle  hop.  We  assume  that  70*  of  all  single  hop 
transmission  paths  of  lengh  10  km  will  be  use- 
able.  The  rest  will  not  be  useable  due  to  sit¬ 
ing,  transmission  path  effects,  and  masking. 
For  paths  of  length  17  km  (second  nearest  neigh¬ 
bors)  the  probability  of  useability  is  reduced 
to  50%.  Longer  paths  are  assumed  to  not  be  use¬ 
able  at  all.  Finally,  the  single  hop  error  free 
transmission  over  a  useable  path  is  assumed  to 
be  70*.  Such  figure  should  be  achievable  in 
practice. 

V IT .2  General  Com-unicatlon  Requirements . 

The  DSN  communication  system  must  furnish 
several  kinds  of  service.  These  are:  (l)-Sensor 
reports  service,  (2)-Surveillance  reports  and 
command  distribution  service,  and  (3)-Special 
point  to  point  service,  each  of  these  is  dis¬ 
cussed  briefly  below. 

'-•Sensor  rs.par.Da  service. 

Each  sensor  node  will  deliver  a  sensor  re¬ 
port  once  each  second  to  all  of  the  tracking 
sites  which  are  within  one  communication  hop. 
Such  reports  are  a  summary  of  the  results  of 
processing  recent  sensor  data.  In  general  a 
node  may  take  more  that  a  second  to  update  its 
local  world  view  based  only  on  its  own  sensors. 
For  example  the  current  design  calls  for  this  to 
be  done  one  every  two  seconds  for  acoustic  data 
and  once  every  four  seconds  for  radar.  Thus 
with  this  situation  more  than  one  sensor  report 
will  be  available  to  distribute  each  new  world 
view  generated  from  the  sensors  at  the  node. 

A  sensor  report  will  contain  up  to  2000 
cits.  Of  these  1500  may  Se  data  and  the  rest 
communication  heaaers,  error  detection,  time  ar.b 
site  information,  and  other  overhead,  liven  a 
message  of  this  sire  each  cracking  code  will  re¬ 
ceive  ' 6x0 . ce6x0.7)x2 . 0*:u . u  kilobits  and 
transmit  2  xiloolts  every  second.  This  follows 
from  the  geometry  and  the  crude  lir.k  isaoility 
model  discussed  above. 


There  may  be  circumstances  in  which  a  sin¬ 
gle  hop  link  from  a  sensor  to  a  tracking  noae 
which  should  use  it  will  not  exist.  Tracking 
computers  will  be  allowed  to  arrange  to  obtain 
data  from  selected  multi  hop  paths  but  only  in 
3o  far  a3  the  system  has  capacity  to  supply  this 
data  without  harming  the  operation  of  the 
overall  system.  This  will  be  done  using  the 
special  point  to  point  service  discussed  below. 
It  is  generally  discouraged.  The  communication 
capacity  for  such  multi  hop  service  i3  part  of 
the  capacity  required  for  special  service  and  is 
not  considered  to  be  part  of  the  basic  sensor 
report  service.  The  basic  sensor  report  mechan¬ 
ism  is  to  be  one  hop. 

The  detailed  use  of  the  2000  bit3  in  a  sen¬ 
sor  report  is  not  yet  specified.  However  it  is 
not  likely  that  the  number  of  bits  (and  thus  the 
traffic)  can  be  substantially  reduced.  Although 
the  typical  number  of  real  targets  in  range  of  a 
sensor  will  be  small— perhaps  one  or  even 
less — the  system  must  be  capable  of  handling 
several  targets  and,  as  a  direct  result  of  try¬ 
ing  to  report  targets  with  poor  signal  to  noise 
ratios,  will  normally  deliver  several  fal3e 
alarms  each  reporting  period.  Thus  if  only  10 
possible  targets  are  reported  (sum  of  actual 
target  measurements  and  of  possible  ones  which 
in  fact  cannot  be  verified  by  multiple  sites)  we 
find  that  only  150  bits  are  allocated  to  each 
possible  target.  In  general  we  can  think  of  the 
sensor  report  as  a  summary  of  important  charac¬ 
teristics  of  frequency-wavenumber  or  range- 
azimuth-doppler  power  distributions  rather  than 
target  reports  as  such.  From  thi3  viewpoint  we 
could  use  some  "50  bits  to  describe  each  of  13 
interesting  looking  peaks  in  those  maps. 

>£££&££.  SuczaUiaasa  and  Command  Distribution 
Service. 

The  DSN  trial  design  includes  surveillance 
computers  located  every  50  km  on  a  uniform  grid. 
Each  of  these  collects  a  summary  of  target  in¬ 
formation  over  the  wnole  area  within  about  25  km 
of  where  it  is  located.  It  does  this  by  col¬ 
lecting  surveillance  Information  from  tracking 
computers  within  its  surveillance  sector. 
Tracking  information  is  passed  inward  towards 
sector  nodes  as  described  in  Section  VI. 

Somewhat  arbitrarily  we  will  require  that 
the  DSN  be  able  to  report  once  each  second  on  at 
least  10  targets  in  the  normal  sector  surveil¬ 
lance  area  without  regard  to  where  in  the  area 
they  are  located.  If  more  targets  are  present 
It  may  sometime  be  possible  to  get  one  second 
reports  but  they  will  not  be  assured.  In  general 
for  more  targets  in  the  area  the  sector  surveil¬ 
lance  computer  can  either  be  selective  of  the 
targets  it  tracks  or  use  more  than  cne  second  to 
complete  3  single  surveillance  survey  of  the 
sector.  If  sector  nodes  are  not  functioning  so 
that  sector  surveillance  areas  are  redefined  mo 
expanded  larger  belays  are  allowed.  Specifical¬ 
ly,  acoroximat ely  one  secono  ielav  is  allcweo 
for  eacr.  normal  surveillance  sector  wrier,  a 
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port  must  traverse. 

Some  1500  bits  should  be  sufficient  to 
represent  the  surveillance  information  for  10 
targets  in  the  area.  If  so  a  node  broadcasting 
its  normal  area  of  interest  surveillance  report 
as  discussed  in  Section  VI  will  broadcast  1500 
data  bits  with  500  bits  of  error  correction  and 
other  packet  header  information.  The  concept 
that  each  node  will  broadcast  a  surveillance  re¬ 
port  each  second  results  in  14.4  kilobits  incom- 
tning  surveillance  reports  and  2.0  outgoing  at 
each  node  as  was  the  case  for  sensor  reports. 

In  addition  to  reports  flowing  into  sector 
nodes  there  will  be  a  flow  of  commands  from  the 
sector  nodes  to  the  nodes  within  the  surveil¬ 
lance  sector.  This  will  be  low  data  rate  com¬ 
pared  to  the  reports  themselves. 

>Scegial  Point  to  Point  Service. 

Some  users  will  need  assured  real  time 
point  to  point  service  in  the  DSN.  For  example, 
above  we  mentioned  alternate  routing  when  the 
standard  one  hop  3ensor  report  distribution  ser¬ 
vice  is  not  adequate  for  a  particular  tracking 
computer.  Sucn  users  should  be  guaranteed  a 
communication  rate  and  delay.  The  circuit  will 
have  assigned  a  probability  that  any  particular 
message  will  actually  be  transfered  from  end  to 
end.  That  probability  will  usually  grow  linear¬ 
ly  with  the  number  of  hops  in  the  overall  cir¬ 
cuit.  At  any  node  along  the  route  of  the  cir¬ 
cuit  the  user  must  be  able  to  extract  and  add 
data  to  messages  as  long  as  the  outgoing  link 
remains  within  the  rated  capacity  of  the  cir¬ 
cuit.  There  will  be  needs  for  assured  service 
with  various  capacity.  For  example  the  alterna¬ 
tive  routing  of  sensor  reports  to  tracking  com¬ 
puters  would  require  2kb  circuits.  Here  we  only 
discuss  the  maximum  assured  capacity  which  the 
DSN  must  offer  for  users. 

A  point  to  point  user  could  be  located  at  a 
sector  surveillance  site  and  need  to  examine  in 
detail  a  great  deal  of  the  data  at  a  single 
tracking  site  in  the  sector.  In  this  case  the 
rate  to  the  tracking  site  is  small-- Limited  to  a 
few  commanda— but  the  rate  into  the  sector  sur¬ 
veillance  site  could  be  large.  The  tracking 
node  of  interest  could  change  as  often  as  every 
30  seconds  to  keep  up  with  a  target  flying  close 
to  the  speed  of  sound.  On  occasion  such  data 
may  al30  be  passed  on  to  a  user  many  surveil¬ 
lance  sectors  away. 

The  high  rate  data  of  Interest  might  be 
sensor  reports,  sections  of  frequency-wavenumber 
power  maps,  range-azimuth-doppler  naps,  or  even 
raw  data.  Acoustic  data  is  one  likely  kind  of 
raw  data.  It  should  be  possible  to  at  least  ob- 
tain  raw  data  from  a  single  channel  of  acoustic 
data.  3uch  data  is  gathered  at  a  2  kilohertz 
rate  with  16  bit  samples.  Assuming  a  very  nomi¬ 
nal  factor  of  two  data  compression  this  will 
give  a  '6  Kilobit  per  second  data  rate.  Other 
data  of  interest  might  be  all  the  sensor  data 


reports  being  used  by  a  particular  tracking  com¬ 
puter.  This  would  amount  to  at  least  3. 2x1. 5 
kilobits  per  second  and  probably  somewhat  more. 
Another  might  be  a  radar  clutter  map  of 
100*36=3600  power  values.  Based  on  such  con¬ 
siderations  we  require  the  DSN  to  be  able  to 
deliver  point  to  point  service  at  rates  up  to  17 
kilobits  per  second. 

If  16  khz  circuits  are  to  be  supported  then 
any  node  must  be  able  to  deal  with  32khz  total 
circuit  traffic  at  least.  This  is  the  sum  of 
the  incoming  and  outgoing  traffic.  It  can  be 
split  arbitrarily  between  input  and  output  in 
general  and  each  stream  can  in  fact  be  any 
number  of  smaller  capacity  circuits.  Each  of 
the  individual  circuits  will  have  its  own  error 
rates  and  quarateed  capacity  and  delays. 

Delay  characteristics  for  point  to  point 
transmission  should  be  similar  to  those  for  sur¬ 
veillance  reporting.  That  is,  the  delays  on  the 
order  of  one  second  per'  surveillance  sector 
traversed  are  to  be  allowed. 

In  general  a  request  for  circuit  service 
must  be  made,  the  system  will  decide  if  it  can 
be  supported,  and  it  will  be  granted  or  denied. 
Priority  requestes  may  result  in  the  ungranting 
of  currently  quaranteed  service. 

2I2..1  Strawman  Communication  Organization. 

The  communication  needs  for  DSN  are  consid¬ 
erable.  As  outlined  in  the  requirements  section 
above  the  message  traffic  (sum  of  incoming  and 
outgoing)  at  a  single  node  can  easily  be  50  ki¬ 
lobits  per  second  or  more  and  the  delays  must 
not  be  excessive.  To  achieve  this  kind  of 
throughput  using  packet  radio  will  take  special 
effort.  Suppose  the  basic  packet  radio  rate  is 
200  kbs.  Random  access  or  carrier  sense  access 
to  such  a  channel  could  seriously  constrain  DSN 
performance.  Their  are  various  options  to  help 
the  situation.  One  is  a  system  very  much  like 
packet  radio  but  with  rates  increased  by  almost 
an  order  of  magnitude  by  increasing  bandwidth. 
A  second  is  to  use  a  layered  packet  radio  net¬ 
work  to  effectively  increase  capacity.  This 
would  involve  running  several  packet  radio  net¬ 
works  simultaneously  ia  the  same  frequency  band 
but  with  different  codes.  Each  radio  would  have 
multiple  decoders  and  multiple  encoders  of  the 
signal.  A  third  option  is  to  recognize  the 
structured  nature  of  the  DSN  system  and  to  make 
use  of  this  to  coordinate  node  transmissions  so 
that  the  available  channel  is  very  heavily  used 
and  self- interference  is  minimized.  This  last 
is  the  option  which  is  further  developed  in  this 
section  to  indicate  how  a  packet  radio  with  a 
200  kbs  rate  might  furnish  all  DSN  communica¬ 
tions.  Minor  changes  such  as  having  a  broadcast 
mode  as  well  as  a  point  to  point  mode  and  allow¬ 
ing  packets  which  do  not  get  retransmitted  if 
not  acknowledged  will  be  required. 

>Qverview  2l  Version  H. 


Version  1A  DSN  communications  is  modular¬ 
ized  by  splitting  all  communications  into 
separate  "circuits".  A  circuit  is  a  particular 
conceptual  and  algorithmic  construct  for  commun¬ 
ications.  A  circuit  is  a  time  ordered  geometri¬ 
cal  pattern  of  packet  transmissions.  Roughly  it 
i3  a  statement  that  specific  nodes  will  transmit 
packets  in  some  specific  order  during  a  given 
time.  The  algorithmic  details  of  chosing  which 
nodes  transmit  when  are  flexible.  8ut  there 
must  be  an  explicit  algorithm  for  each  type  of 
circuit. 

It  is  necessary  to  assign  specific  time  in¬ 
tervals  to  each  circuit.  Some  circuits  have  the 
potential  for  interfering  substantially  with 
other  circuits,  and  must  be  separated  in  time. 
Other  circuits  may  not  seriously  interfer  with 
each  other,  and  need  not  be  separated  in  time. 
It  takes  10  milliseconds  to  transmit  a  maximum 
length  2000  bit  packet.  Therefore  we  will 
divide  each  second  into  100  intervals  of  10  mil¬ 
liseconds  each,  and  assign  some  number  of  these 
10  milll-second  intervals  to  each  circuit. 

Version  1A  defines  three  kinds  of  circuits: 
the  sensor  circuit,  the  report  circuit,  and  spe¬ 
cial  (query)  circuits.  These  correspond  to  the 
three  kinds  of  services  discussed  in  the  re¬ 
quirements  section  above.  Of  the  100  10  mil¬ 
lisecond  time  intervals  in  each  second,  25  are 
allocated  to  the  sensor  circuit,  25  to  the  re¬ 
port  circuit,  and  50  to  the  various  special  cir¬ 
cuits. 

The  sensor  circuit  moves  messages  from  each 
node  to  all  its  neighbors.  There  are  no  ack¬ 
nowledgements,  and  any  data  that  is  to  arrive 
with  high  probablility  must  be  retransmitted  re¬ 
peatedly. 

The  report  circuit  moves  messages  from  each 
node  to  sector  nodes,  which  in  turn  are  expected 
to  move  the  messages  to  a  manned  computer 
center.  The  report  circuit  also  moves  occas¬ 
sional  control  messages  in  the  reverse  direc¬ 
tion,  from  the  sector  nodes  to  all  other  nodes. 
Acknowlegment  is  optional  for  each  message.  Be¬ 
cause  of  the  many  paths  available  to  or  from 
sector  nodes,  an  unacknowleged  message  has  quite 
a  high  probability  of  arriving  at  its  destina¬ 
tion,  though  this  probability  goes  down  when  re¬ 
port  circuit  traffic  becomes  very  heavy  and  the 
number  of  paths  is  reduced  to  accommodate  the 
load.  Report  circuit  messages  are  broadcast  so 
they  may  be  heard  by  all  surrounding  nodes,  and 
thereby  are  also  the  means  by  which  each  node 
knows  what  its  neighbors  are  tracking. 

Special  circuits  are  established  on  request 
between  any  two  nodes  for  lata  rates  up  to  17 
kilobits  ter  second  in  one  direction,  and  a  low 
data  rate  in  the  reverse  direction.  Usually 
these  circuits  are  used  for  querying  a  node,  and 
transmit  data  principally  from  that  node  to  the 
nearest  sector  node.  Acknowlegment  ar.d  re¬ 
transmissions  are  automatic  for  special  cir¬ 
cuits  ,  except  that  messages  can  be  marked  to  be 


thrown  away  if  the  backlog  in  the  special  cir¬ 
cuit  exceeds  one  of  several  limits  (which  in 
normal  situations  happens  very  rarely) . 

Higher  level  system  functions  will  general¬ 
ly  use  both  report  and  special  circuits. 

CCommunlcatlon  Patterns. 

Simultaneous  broadcasts  by  packet  radios 
separated  by  at  least  50  kilometers  cannot  in¬ 
terfer  with  each  other  for  any  DSN  purpose. 
Therefore  we  want  to  organize  communication  so 
that  only  nodes  separated  by  at  least  50  kilome¬ 
ters  will  broadcast  simultaneously.  This  is 
done  by  breaking  the  DSN  into  communication 
cells  with  50  km  sides  and  arranging  for  no  more 
than  one  node  in  a  cell  to  broadcast  at  any 
given  time.  We  illustrate  the  idea  here  in 
terms  of  the  reports  circuit. 

The  report  circuit  performs  three  func¬ 
tions:  it  conveys  report  messages  from  arbitrary 
nodes  to  sector  nodes,  from  whence  they  are  sent 
to  manned  control  centers;  it  broadcasts  track¬ 
ing  results  from  each  node  to  all  neighboring 
nodes,  as  an  aid  to  the  tracking  programs  in 
neighboring  nodes;  and  it  broadcasts  control 
massages  from  each  sector  node  to  all  surround¬ 
ing  nodes. 

The  report  circuit  is  a  circuit  with  a  typ¬ 
ical  broadcast  pattern  as  shown  in  Figure  5. 
The  nodes  within  the  parallelogram  constitute 
one  communication  cell.  Such  cells  are  repeated 
throughout  the  DSN.  With  rather  minimal  local 
coordination  the  pattern  will  effectively  avoid 
communication  conflicts.  The  pattern  is  repeat¬ 
ed  once  every  second.  As  shown  the  sequence  of 
broadcasts  is  1A...1F,  2A...2F,  3A...3F.  **, 

5A _ 5F.  Such  a  pattern  can  be  used  to  move 

data  towards  the  sector  nodes  or  away  from'  the 
sector  nodes. 

To  see  the  flow  away  from  the  sector  node 
consider  the  cycle  starting  with  1  and  proceed¬ 
ing  5A...5F,  1A...1F,  2A...2F.  This  can  be  used 
to  move  data  away  from  one  sector  node  toward 
another  in  case  the  first  sector  node  is  non¬ 
functional,  or  has  lost  touch  with  the  manned 
computer  center.  3ecause  it  is  capable  of  mov¬ 
ing  data  away  from  sector  nodes,  the  report  cir¬ 
cuit  is  also  used  to  broadcast  control  messages 
originating  at  manned  computer  centers  and  rout¬ 
ed  through  the  sector  nodes. 

The  sequence  1A...1F,  2A...2F,  3A...3F, 
5A...JF  (b  is  not  used)  obviously  can  be  used  to 
move  data  towards  the  sector  node.  Since  the 
cycle  is  repeated  once  each  second  reports  from 
the  more  distant  nodes  in  the  sector  will  t#  re¬ 
freshed  eacn  second. 

It  should  be  noted  that  this  communication 
sc.neme  assumes  that  the  DSN  can  control  the  time 
of  a  broadcast.  Also  the  flew  over  multiple 
hops  is  net  accomplished  strictly  iy  multi  nco 
communication  paths  under  control  of  the  oemmur.- 
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icatlon  ays tea.  The  DSN  computers  have  access 
to  messages  in  transit  and  modify  these  messages 
before  they  are  retransmitted.  For  example,  in 
moving  data  toward  the  sector  node  the  node  at 
5C  in  the  pattern  has  probably  heard  1C,  36,  and 
3C  with  enough  lead  time  to  Incorporate  their 
information  in  the  packet  it  will  transmit. 


VTII-  SENSORS  AND  SENSOR  DATA  PROCESSING. 

As  noted  in  several  other  sections  of  this 
paper  each  strawtaan  DSN  nods  includes  a  small 
acoustic  array  and  a  small  radar  In  the  fol¬ 
lowing  brief  paragraphs  we  summarize  the  general 
strawman  sensor  characteristics  and  processing 
load  required  for  those  sensors.  These  sensors 
and  loads  are  only  representative.  This  is  par¬ 
ticularly  true  of  the  radar  option  for  which 
there  is  a  need  for  considerable  design  and 
analysis  work  before  a  sensor  can  be  proposed 
with  much  confidence.  Low  flying  targets  in 
general  and  cruise  missiles  in  particular  do 
present  difficult  sensor  problems.  More  straw- 
man  sensor  and  sensor  performance  details  will 
be  found  in  the  papers  "Single  Node  Detection 
and  Target  Parameter  Estimation”  by  P.  Demko  and 
"Acoustic  Sensor  Capabilities  and  Performance" 
by  T.  Landers  which  appear  elsewhere  in  this 
workshop  proceedings. 

Each  node  in  the  strawman  design  includes 
an  array  of  10  high  quality  microphones  deployed 
over  a  plane  aperture  of  3  meters.  Digital  data 
are  collected  at  the  rate  of  2000  samples  per 
second  per  channel.  Once  every  two  seconds  a  one 
second  sample  of  array  data  are  processed  to 
detect  targets  and  estimate  the  direction  from 
which  sound  is  commlng  at  a  large  number  of  dif¬ 
ferent  frequencies.  Alternate  one  second  inter¬ 
vals  of  data  are  not  collected  and  the  time  may 
be  used  from  sensor  calibration  or  other  pur¬ 
poses.  Single  node  target  detection  and  direc¬ 
tion  estimation  makes  use  of  high  resolution 
frequency-wavenumber  signal  processing.  Detec¬ 
tion  and  estimation  depend  upon  finding  in¬ 
creases  in  the  amount  of  power  which  appears  to 
arrive  at  the  array  as  a  function  of  direction 
and  frequency.  The  single  node  processing  load 
amounts  to  about  11  million  real  adds  or  multi¬ 
plies  per  second. 

A  small  monostatlc,  coherent  pulsed 
doppler,  two  dimensional  radar  has  been  tenta¬ 
tively  selected  as  the  strawman  DSN  radar.  The 
strawman  radar  at  this  time  is  not  a  serious 
proposal  for  a  specific  radar  but  rather  is  a 
somewnat  arbitrary  selection  of  some  possible 
radar  characteristics  for  the  purpose  of  sizing 
computational  requirements.  The  radar  frequency 
and  other  characteristics  are  subject  to  sub¬ 
stantial  changes  as  a  result  of  further  study. 
Even  the  type  of  radar  may  be  changed  but  it  is 
Liweiy  that  our  present  selection  would  be  the 
most  computationally  demanding  alternative  so  it 
has  been  selected  as  worst  case  starting  point. 
The  strawman  radar  is  L  band  (1.3  GHz).  It  uses 
a  'D  kHz  pulse  repetition  rate  to  operate  out  to 


about  12.5  km  ranges.  The  antenna  has  about  10 
degree  azimuth  resolution  and  30  degree  eleva¬ 
tion  beamwidth.  The  radar  scans  360  degrees  us¬ 
ing  36  beams  displaced  from  eacnother  by  10  de¬ 
grees.  Nine  beam  directions  are  used  each 
second.  State  of  the  art  digital  processing  is 
'used  to  detect  targets  in  radar  ground  clutter 
and  to  make  most  effective  use  of  doppler  shifts 
resulting  from  target  motion.  Targets  are  lo¬ 
cated  within  120  meter  range  cells  but  with  poor 
direction  information.  The  processing  load, 
about  half  of  which  is  FFT  amounts  to  25  million 
read  adds  or  multiplies  per  second  for.  a  single 
such  radar. 


H2J2£  BABamg  CONFIGURATION . 

The  hardware  configuration  for  each  node  in 
the  Version  1A  DSN  is  shown  in  Figure  7.  Nei¬ 
ther  the  sensor  nor  the  packet  radio  will  be 
discussed  further  in  this  section.  The  other 
electronic  hardware  is  assumed  to  be  built  with 
currently  existing  integrated  circuits  but 
higher  level  elements  of  the  system  are  not  now 
cosmiercially  available.  Ue  have  specified  this 
hardware  to  avoid  compromising  system  perfor¬ 
mance  due  to  the  use  of  slower  commercial  equip¬ 
ment.  The  specification  of  considerable  comput¬ 
er  hardware  at  each  node  will  give  the  most 
flexibility  for  software  options.  Even  more 
computer  power  would  be  technically  feasible  but 
no  well  defined  requirement  has  yet  developed. 
The  option  shown  is  sufficient  Tor  the  worst 
case  radar  and  acoustic  signal  processing  at  a 
node  with  enough  spare  power  for  other  DSN  func¬ 
tions.  Ongoing  design,  simulation,  and  experi¬ 
mental  efforta  will  be  used  to  determine  if  the 
processing  capability  represented  by  Figure  7 
will  actually  be  required. 

Except  for  a  small  amount  of  sensor  specif¬ 
ic  and  interface  electronics  the  electronics 
consist  of  a  main  computer,  memory,  and  single 
instruction  multiple  data  stream  processor 
(SIMD)  which  could  satisfy  all  known  processing 
requirements  for  a  node.  The  outline  of  a 
design  for  the  main  electronics  has  been  com¬ 
pleted  and  the  numbers  of  integrated  circuits 
required  for  each  electronic  subsystem  has  been 
roughly  estimated.  For  the  estimates  obtained 
and  shown  on  the  Figure  the  cost  of  the  entire 
node  built  to  MIL  SPEC  would  be  $120,000  if  we 
assume  that  system  cost  is  propostional  to  the 
number  of  itegrated  circuits  and  the  system  cost 
per  circuit  is  $60.  The  $60  figure  is  reason¬ 
able  at  the  present  time  and  will  probably  de¬ 
crease  by  a  substantial  percentage,  between  30S 
and  501,  each  year  in  the  immediate  future.  The 
design  and  these  estimates  do  not  make  use  of 
special  developments  in  very  large  scale  in¬ 
tegration  or  of  special  custom  chips.  Such  al¬ 
ternatives  could  substantially  reduce  cost, 
size,  and  power  consumption. 

The  main  processor  serves  for  general  pur¬ 
pose  computing  and  also  as  an  input/cutout  chan¬ 
nel  processor  for  main  memory.  In  its  role  as  a 
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general  purpose  processor  it  will  perform  all 
processing  at  the  node  which  is  not  mors  ap¬ 
propriately  done  by  the  faster,  essentially 
parallel,  SXMD  processor.  This  could  include  a 
substantial  amount  of  sensory  A1  processing. 
Such  sensory  AI  would  be  to  make  more  effective 
use  of  multidimentional  power  maps  produced  by 
single  sensor  signal  processing  as  the  first 
step  toward  target  detection  and  parameter  esti¬ 
mation.  The  strawman  design  has  the  main  pro¬ 
cessor  running  an  extension  of  the  PDP-11  in¬ 
struction  set  so  that  it  can  emulate  the  PDP-11 
with  high  efficiency.  The  main  differences 
would  be  the  following  two.  First,  instruction 
lengths  are  not  the  same:  the  instructions  have 
been  reformated  to  allow  32  bit  addressing,  32 
registers,  etc.  Second,  Instructions  must  be 
treated  as  read  only  because  the  main  processor 
design  includes  an  Instruction  cache  which  is 
not  updated  by  data  writes. 

The  main  processor,  although  it  1s  20  times 
faster  than  typical  commercial  minicomputers  in 
tight  loop  arithmetic  computations,  is  still  not 
fast  enough  to  handle  the  several  tens  of  mil¬ 
lions  of  arithmetic  operations  per  second  re¬ 
quired  for  full  acoustic  and  radar  data  process¬ 
ing.  For  this  purpose  a  special  arithmetic  pro¬ 
cessor  is  employed.  This  processor  is  a  10  ele¬ 
ment  3IMD:  a  single  instruction  stream,  multiple 
data  stream  processor  with  10  data  units  and  one 
instruction  execution  unit.  Both  acoustic  and 
radar  processing  at  a  single  node  naturally 
decompose  to  be  efficiently  accomplished  using 
3uch  an  architecture.  The  SIMD  design  takes  ad¬ 
vantage  of  currently  available  high  speed  1 6x16 
multiplier  chips. 

Almost  all  circuitry  is  intended  to  be  low 
power.  Micro-cycle  times  are  250  nanoseconds, 
and  only  a  little  processing  is  done  in  each 
micro-cycle.  For  example,  single  precision 
floating  point  adds  are  done  in  750  nanoseconds 
in  the  main  computer,  whereas  Schottky  circuitry 
would  do  them  in  only  200  nanoseconds.  The  main 
exception  to  low  power  are  16x16  multiplier 
chips  to  be  used  by  the  SXMD  processor.  Each  of 
those  chips  consume  several  watts  of  power. 

The  processor  briefly  outlined  here  is  not 
presented  as  the  only  practical  option  for  the 
node  electronics.  We  have  pursued  a  specific 
option  as  part  of  our  strawman  design  only  to  be 
able  to  put  on  display  one  option  which  is 
feasible  and  will  satisfy  our  reouirements. 
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Fig.  1.  The  three  basic  DSN  deployment  configu 
ration. 
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AREA  GEOMETRY  SENSOR  CIRCUIT 
INFORMATION  FLOW  FROM  A  NOOE  TO  NEIGHBORS 

Tig.  3.  Nominal  single  hop  receptions  of  a  node 
sensor  report. 
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"ig.  - ■  Surveillance  sectors  and  node  areas  of 
interest  with  all  sector  nodes  opera¬ 
tional  . 
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Fig.  5.  Redefinition  of  surveillance  sectors 
when  a  sector  node  has  failed. 


•  OSN  NODE  (*  EQUIDISTANT  BETWEEN  SECTOR  NODES) 

—  COMMUNICATION  PATTERN  CELL 

—  ENCLOSES  NOOES  REPORTING  TO  THIS  SECTOR 
NOOE  AS  CLOSEST  SECTOR  NOOE 
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REPORTING  PATTERN  AND  FLOW 
Fig.  6.  Report  circuit  broadcast  pattern. 
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DISTRIBUTED  SENSORS  NETWORKS  (DSN): 
AN  ATTEMPT  TO  DEFINE  THE  ISSUES 
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!■  INTRODUCTION 

This  paper  iraws  upon  ideas  from  a  number  of 
sources  and  persons,  in  particular,  lengthy 
discussions  with  J.  Barnett  and  D.  Cohen,  whose 
invaluable  help  I  wish  to  acknowledge.  However,  I 
oear  tne  sole  responsibility  for  the  opinions 
exnressed.  the  errors  made,  and  tne  final  fora  of 
tne  paoer. 


1  .3  Tih  ‘eetlva  of  thl a  oaoer 


It  seems  appropriate  at  this  early  stage  of 
research  to  take  a  global  view  of  tne  DSN  problem. 
This  caper  presents  an  Initial  attempt  to  obtain 
inis  goal;  it  should  be  read  as  a  working  paper 
wnose  pain  purpose  is  to  trigger  responses  and 
focus  attention  on  some  of  tne  Issues. 

The  aethod  we  have  chosen  to  nelp  define  the  issues 
tenters  on  identifying  the  potential  functions  of 
DSN.  These  issues  include  intelligent  integrated 
signal  processing  architecture,  developing 
architecture  to  support  decentralized  cooperative 
processing,  and  overall  design  problems. 

Finally,  the  development  of  DSN,  integrating  three 
major  technologies  (i.e.,  sensor, 
computer-communication,  and  information-processing 
technologies) ,  will  require  expertise  in  aany 
fields.  We  nave  compiled  an  annotated  bibliography 
of  some  xey  papers,  survey  papers  and  books  that 
may  help  the  reader  to  gain  insignts  into  the  scope 
and  focus  of  some  relevant  technologies. 

1*1  don.  laaiaaigfflu 


3y  a  feature- set  we  mean  a  set  whose  elements 
represent  possible  values  of  some  target  parameter 
e.g.,  position,  heading,  type  of  aircraft, 
etc.).  A  target  feature  is  a  time  function  assuming 
values  in  some  feature-set  (e.g.,  the  track  pursued 
oy  a  target,  the  velocity  of  a  target,  the  spectral 
formants  o'  the  noise  emitted  Py  a  target,  etc.).  A 
• arzet  ip  an  oojeot  consisting  of  a  3et  of  features 
and  certain  relationships  between  them.  A  aer.sor 
is  a  ievloe  that  can  observe  a  function,  to  oe 
oi. led  oeservatlon.  of  some  target-features.  The 
observation  lepenos,  in  addition  to  tne  feature 


observed,  upon  two  sets  of  parameters:  lntri n si c 
sensor's  parameters  (e.g.,  the  object  of  tne 
sensor's  observation,  the  rate  of  sampling, 
internal  noise,  etc.)  and  extrinsic  parameters, 
resulting  from  the  interaction  between  tne 
observable  feature  and  the  sensor's  environment 
(e.g.,  limited  field  of  view,  weather  conditions, 
etc.) . 

A  DSN  consists  of  a  set  of  sensors,  a  communication 
network,  and  a  set  of  processing  elements.  The 
sensors  may  observe  targets  in  a  given  area;  the 
information  collected  by  the  sensors  is  than 
communicated  to  elements  tnac  process  the 
observations  and  communicate  between  themselves  to 
extract  target-features.  Note  that  we  classified 
the  network  elements  according  to  functions; 
traditionally,  a  few  of  those  functions  are 
sometimes  incorporated  into  a  single  device,  the 
borderline  between  functions  not  always  clearly 
marked. 

I2S1L.  ao.  lafsrancB-dBrivatlen  ay  a  tea. 

A  DSN  pay  be  thought  of  as  a  large 
inference-derivation  system.  The  process  of 
inferring  is  based  upon  some  prior  knowledge  about 
relations  between  target-features  and  cbservaticr.3; 
it  may  consist  of  both  logical  and  statistical 
derivations  and  foilow,  typically,  the  Bayesian 
Inference  paradigm.  That  is,  the  ooserved  data  is 
employed  to  improve  apriorl  knowledge  of  targets. 
The  inferred  posterior  knowledge  serves  as  tne  new 
prior  knowledge,  on  the  oasis  of  wnicn  new 
observations  are  to  Oe  explained.  Accordingly,  tne 
state  of  the  inference  process  at  each  moment 
consists  of  the  set  of  all  present  beliefs  that  it 
holds.  This  state  is  modified  as  new  observations 
or  inferences  are  made. 


Loosely  speaking,  the 

intelligence 

of  1 

OSH 

consists  of  its  ability 

to  recognize, 

learn, 

ana 

use  relationships  between  features 
Inferences . 

tc 

rive 

As  an  example  oensider 

a  DSN  that  0 

oserves 

tne 

aooustio  emissions  ind  radar  reflections  from  5  set 
of  low-flying  aircrafts.  .At  the  lowest  level  of 
recognition,  the  DSN  tries  to  infer  target 
parameters  suoh  as  azimuths,  ranges,  and  velocities 
from  the  observations.  Dnoe  these  higher-level 
features  are  extracted,  the  DSN  needs  to  relate  tne 
parameters  extracted  from  tne  accustio  nci3e  ant 
tnose  extracted  from  radar  reflection.  The  next 
steo  nay  Oe  to  compare  the  present  estimation  to 
Information  acquired  in  tne  past  ir.  orqer  -0 


extract  features  of  the  tracks  pursued  By  the 
targets.  At  still  a  higher  level  of  intelligence, 
it  may  be  possible  to  extract  information  about  a 
possible  enemy-plan  from  a  set  of  target  tracks  and 
their  composition. 

Although  nigher-level  features  aay  be  inferred  from 
combinations  of  lower  level  features,  the  t nfarenee 

aracaaa  1a  oat  aniiUacUaial -  in  fact, 

higner-levei  recognition  of  features  can  be  fed 
back  to  lower-level  inference  derivations  to 
improve  their  prior  information.  This  typical 
Bayesian  approach  to  inference  can  serve  to  cut 
down  the  comolnatorlally  explosive  size  of 
processing  at  the  lower-levels.  For  Instance,  in 
an  acoustic  array  tne  spatially  distributed 
reservations  of  acoustic  emissions  are  temporally 
and  spatially  correlated.  The  correlation  function 
is  analyzed  speccrally  to  produce  a  map  of  the 
power  spectrum  as  a  function  of  the  vector  wave 
number.  The  azimuth  to  the  source  of  emission  is 
found  by  a  maximum  likelihood  analysis  of  the  map. 
The  process  of  mapping  the  power  spectrum  in  the 
wave-number  plane  involves  an  enormous  amount  of 
computation.  The  amount  of  computation  aay  be 
radically  cut  down  if  the  algorithm  had  some  prior 
"clues"  on  the  nature  of  the  source  (l.e.,  expected 
direction  of  arrival,  expected  speed,  spectral 
signature)  from  some  higher-level  recognition 
process.  Sucrt  clues  can  be  used  to  reduce  the 
range  of  wave  numbers  to  be  searched. 


The  two  major  problems  of  tne  DSN  project  are 


1.  To  develop  integrated  inference  systems 
to  extract  target-features  from 
distributed  observations. 

2.  To  develop  decentralized  architecture  and 
algorithms  to  support  the  inference 
system. 

la  what  follows  we  shall  consider  some  of  the 
issues  arising  from  the  two  problems.  However, 
before  proceeding  to  a  detailed  discussion  of  the 
issues,  let  us  comment  on  the  significance  of  the 
DSN  development  effort  to  other  related 
technologies . 


XA  DSN,  what  . 


It  is  premature  to  try  to  answer  this  question 
accurately.  Nevertneless  it  is  important  to  keep 
in  mind  some  possible  scenarios  of  military 
information  processing  in  the  next  decade. 
Therefore,  rather  than  attempting  to  answer  the 
question,  we  will  present  some  possible 
technological  problems  to  which  the  DSN  research 
effort  aay  be  highly  relevant. 


let  us  consider  some  of  the  results  of  distributing 
DSN  components  (l.e-,  sensors  and  processors).  The 
purpose  of  the  distribution  is  twofold.  First, 
distributing  sensing,  processing,  and  communication 
capabilities  obtains  a  higher  degree  of  reliable 
coverage  and  survivability.  Second,  the  function 
of  sensing  may  oest  be  served  by  distributing  the 
sensing  power.  Indeed,  since  a  sensor's 
capabilities  may  depend  upon  its  relative  location 
witn  respect  to  tne  target,  distributing  ths 
sensing  capabilities  augments  the  area  covered  by 
any  3ingle  sensor.  This  is  particularly  true  of 
target  scenarios  in  which  the  targets  try  to  avoid 
detection  !e.g.,  low-flying  aircraft,  silent 
submarines).  In  addition,  using  a  variety  of 
sensors  can  provide  an  effective  corroboration 
mechanism  Between  different  observations. 


The  distribution  of  the  observation  process  implies 
a  distribution  of  the  processing  and  a  suitable 
communication  mechanism  between  the  different 
parts,  posing  difficult  architectural  problems. 
How  should  we  organize  tne  network?  How  should  we 
allocate  functions,  responsibilities  and  resources? 
How  snould  we  establish  useful  communication 
mechanisms?  Me  call  these  organizational  issues 
problems  of  decent  rail  zed  architecture. 


We  3nail  reserve  the  word  distributed  to  denote 
spatial  distribution,  wnile  decentral iced  will 
denote  a  mode  of  organization  and  operation  wnere 
no  single  process  possesses  the  power  and  knowledge 
to  monitor,  control,  and  coordinate  the  activities 
of  all  other  processes. 


In  the  mid-1980s  military  effectiveness  will  depend 
upon  rapid  collection,  interpretation,  evaluation, 
and  dissemination  of  information.  3y  that  time  a 
major  portion  of  all  strategic  and  tactical 
information  will  be  computer-prooessable,  and 
automated  Information  processing  will  replace 
traditional  manual  methods.  Acnieving  an 
integrated  information  system  encompassing  the  full 
spectrum  of  command,  control,  and  communication 
(C3)  functions  will  require  a  major  architecture 
qeveiopment  effort  in  order  to  integrate  tne 
Corresponding  c3  technologies. 

The  Distributed  Sensor  Networks  (DSN)  research  will 
Contribute  to  the  development  of  integrated  c3 
information  systems  ia  two  forms:  first,  smart 
sensor  networks  will  form  an  essential  part  of 
future  Integrated  military  information  systems,  and 
second,  DSN  offer  a  microcosm  bf  tne  future 
operation  of  integrated  c3  systems. 

Problems  of  developing  real-time, 
funetlon-oriented,  high-level  protocols,  piecing 
together  partial  information  to  extract 
intelligence  and  feeding  tnis  intelligence  back  to 
alleviate  computational  processes,  designing 
software  for  decentralized  large-scale  processing, 
and  Integrating  communication  and  processing  t; 
support  each  other  are  not  unique  to  DSN,  but  are 
an  essential  part  of  C3  systems  architecture.  It 
is  in  this  sense  that  the  DSN  project  transcends 
its  immediate  goals  and  C3n  be  expected  to 
contribute  to  the  development  of  oonceots, 
methodologies,  and  solutions  to  general  oronlems  tf 
Z-  systems  architecture. 
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2.  3SN  FUNCTIONS. 

Tne  first  issue  to  be  addressed  is:  What 
specifically  do  we  expect  DSN  to  do  for  us?  The 
DSN  goal  is  to  extract  information  about  targets 
from  observables  detected  by  a  set  of  sensors  and 
then  to  report  it.  This  goal  needs  to  be  defined 
more  accurately. 

The  description  of  the  goal  of  a  DSN  must  consist 
of  two  parts.  First,  it  is  necessary  to  describe 
tne  environment  in  wnicn  the  DSN  will  operate. 
What  type  of  targets  are  expected?  What  are  the 
observables  produced  by  those  targets  and  the 
relationships  between  them?  What  do  the  sensors 
observe?  What  are  the  characteristics  of  the 
geographical  environment  in  which  the  DSN  is  to 
operate?  and  so  on. 

Second,  it  is  required  to  describe  the  type  of 
information  that  we  wish  the  DSN  to  deliver  and  the 
mode  of  reporting.  What  are  the  features  that  we 
wlsn  to  be  reported  by  the  DSN?  Where  snould  the 
reports  be  delivered?  When  should  features  be 
reported?  How  should  an  interactive  mode  of 
reporting  be  establisned  (l.a.,  to  permit  human 
operators  to  request  specific  types  of  reports,  or 
to  focus  the  attention  of  the  DSN  on  features  of 
particular  interest)? 

In  addition,  a  performance-evaluation  scheme  should 
oe  establisned  to  measure  the  merit  of  the  DSN. 
What  is  it  precisely  that  we  wlsn  to  optimize  in 
the  "quality"  of  reports?  Is  it  possible  to  direct 
the  DSN  through  an  interactive  process  to  adapt  to 
different  external  performance  objectives  (say,  as 
defined  ay  a  human  operator)? 
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Work  3heuld  be  carried  out  to  abstract  and  define 
preciseiy  the  boundary  of  interaction  between  DSN 
and  human  operators,  to  establisn  agreed  linguistic 
primitives  for  proper  interaction,  to  define  the 
possible  performance  objectives  of  DSN,  and  so  on. 


Dnae  tne  expected  functions  of  OSN  are  weil 
understood,  we  can  proceed  to  develop  the  system  to 
provide  those  functions.  The  problems  that  we  face 
consist  (as  noted  in  the  previous  section)  of  two 
classes:  oroolems  of  drawing  inferences  from 
distributed  observations  and  problems  of  organizing 
decentralized  architecture.  In  the  following 
sections  we  consider  the  two  classes  of  problems 
more  carefully. 
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In  this  section  we  snail  restrict  our  sttention  to 
me  "03"  part  of  DSN:  Namely,  what  are  the  Issues 
in  developing  intelligent  processing  of  distributed 
observations  from  a  variety  of  sensors?  The 
constraints  of  decentralized  processing  of 
observations  will  be  considered  in  tne  next 
section. 


DSN  are  large  integrated  signal -processing  systems 
wnose  main  differences  from  classical  sensor 
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systems,  as  far  as  signai-prooassing  is  concerned, 
are  the  Level  of  intelligence  that  we  expect  to 
extract  from  signals  and  the  intelligence  that  is 
required  to  produce  this  information.  Intelligence 
is  the  answer  to  the  challenge  of  tracking  targets 
that  can  not  be  fully  observed.  That  is,  in  the 
aosence  of  full  observation  of  the  targets,  we  nave 
to  use  Intelligence  to  piece  together  the  little 
information  that  we  may  have.  what  are  tne 
important  issues  fcfl  Ha  addressed  ia  jBYClCDiflg 
aronitacture  support  I  nr.al  1  <  gent 
inference-derivation  aYatBBB? 

To  answer  the  question  posed  aoove  we  should 
digress  on  the  problem  of  inferring  from 
observations.  Pernaps  the  beat  way  to  proceed  is 
by  considering  a  detailed  example.  We  shall 
consider  a  DSN  using  acoustic  sensors  in  order  to 
ground  the  issues  in  a  specific  context.  However, 
the  Issues  raised  are  by  no  means  restricted  to  the 
example  and  can  be  easily  generalized  to  other  DSN 
scenarios. 


lei.  Spatially  distributed  reservations 


^■1  -1  Spatially  distributed  aSMCaUlB*- 

Consider  the  case  of  an  aircraft  emitting  acoustic 
noise.  The  level  of  noise  and  its  spectral 
composition  depend  upon  intrinsic  properties  of  the 
sources  of  emission  te.g.,  engine  type  and  rpms). 
The  Boise  emitted  is  modified  by  the  environment  of 
the  source  (e.g.,  atmospheric  conditions, 
geomorphology)'  to  induce  a  rieid  of  acoustic  noise 
over  a  certain  region.  This  acoustic  field  is  a 
typical  spatially  distributed  observable. 

T,.1 .2  QtaaBrYatigna., 

The  acoustic  field  may  be  sampled  by  a  microphone. 
The  sampled  wave  provides  locaj,  information  about 
the  observable.  The  local  sample  may  be 
statistically  analyzed  to  extract  some  target 
features.  Specifically, 

1.  It  1s  possible  to  test  the  hypothesis 

that  the  observed  sample  has  been 

produced  by  a  target  against  the 
hypothesis  that  it  is  sn  environmental 
noise. 

2.  It  is  possible  to  estimate  tne  speed  of  a 
moving  target  from  the  Doppler  shift  of 
frequencies. 

3.  It  is  possible  to  dlassify  the  target 
from  its  spectral  signature. 

u.  It  is  possible  to  estimate  tne  azimuth  to 
the  target  using  a  directional  microphone. 


The  possibility  and  accuracy  of  tne  aoove 
inferences  depend  upon  two  factors:  t.he  quality  of 
ooservatlons  and  tne  scpnisticaticn  of  tne 
inferring  processes. 


•  •  .  “  *  * 


’-'A  A* 
%* A  A 


V  A 


x 


rcr  instance,  tasting  wnether  a  target  exists 
against  oackgrcund  noises  depends  upon  the  level  of 
oackgrcund  noise  ,a.g.,  strong  winds  may  sound  iike 
targets).  However,  a  sopnisticated  process  could 
-earn  aoout  characteristics  of  tne  environmental 
noise  fros  past  ooservations  and  use  this 
'-nteil lgenoe  to  cBtain  sore  accurate  resolution  of 
nypotnesis.  for  instance,  it  nay  oe  possible  to 
learr.  aoout  tne  spectral  characteristics  of  tne 
wins  and  identify  new  spectral  components  belonging 
to  ctr.er  sources,  as  the  sources  appear.  How 
snow,.  i  inference  algorithms  jaaffl  Xdl  cast 
coservat lens  ?  How  could  the  information  learned  ne 
ised? 


processing.  for  instance,  in  an  array  of 
aicropnones,  tne  extraction  of  azimuth  to  the 
30 urea  requires  that  the  sampled  observations  of 
the  acoustic  wave  be  correlated.  Thi3  process 
requires  a  wide  communication  bandwidth  between 
eacn  ooserver  and  the  processing  algorithm.  On  the 
otner  .hand,  wnen  the  information  to  oe  processed 
consists  of  a  small  number  of  azimuth  measurements 
(from  a  few  arrays),  the  communication  oandwidtn 
demand  is  small .  What  are  the  suitable 
organizational  aLClEUlESa  223.  ia  aSfiL 
space-time  coordination  rgauirSit  UX  122  Inferring 
processes? 


1 ■  ■  ■  -  i  Inferring  fcaa  spatially  distributed 
Cbservations. 

In  - ne  previous  section  we  considered  point 
ooservations  of  acoustic  emissions.  The  main 
problem  to  oe  ipproacned  in  this  section  is:  What 
2ia  12,  inferred  from  2  discrete  a fit  5Satl2llY 

list. -lout ed  samples’ 

first,  it  may  oe  possible  to  employ  spatially 
ilstriouted  observations  to  improve  the 
signai-to-noise  ratio  (SNR).  In  other  words,  oy 
aoDiying  averaging  processes  to  tne  set  of  point 
ooservations,  it  is  possible  to  reduce  uncorrelated 
noises  (Just  another  application  of  the  laws  of 
large  numoers).  The  noises  to  Oe  reduced  are  those 
produced  oy  tne  environment  and  intrinsic  sensor 
noises.  The  internal  noises  of  differsnt  sensors 
can  oe  wept  uncorreiated  oy  using  standard  methods. 
The  environment  is  more  difficult  to  handle.  If 
tne  sensors  are  far  enough  apart,  the  environmental 
noi3es  are  uncorrelated.  However,  if  tne  sensors 
are  close  to  eacn  other,  tne  environmental  noise 
may  be  nignly  correlated  and  an  averaging  process 
will  nave  an  adverse  effect  on  noise  Level  (e.g., 
tne  noise  of  wind  in  a  tightly  packed  array  of 
mioropnenes  is  a  typical  example).  It  Is  possible 
to  use  better  filtering  processes  to  reduce 
C'rrelated  noise.  However,  tnis  requires  that  we 
have  information  aoout  tne  nature  of  the  noise, 
possibly  also  requiring  a  filter  that  learns  and 
adapts  to  the  noise.  In  addition,  a  sophisticated 
r.cise  filtering  may  be  computationally  expensive. 

Second,  it  is  possible  to  use  the  distributed 
ooservations  to  infer  spatial  properties  of  the 
noise  source.  for  instance,  oy  correlating 

3oatlally  distributed  observations  of  an  acoustic 
wave,  it  is  posslole  to  infer  the  direction  of 
prooagation  and  thus  tne  azimuth  to  tne  target. 

Suon  a  oaoaoillty  requires  mat  tne  processing 
algoritnm  oe  aware  of  tne  time  and  location  of  the 
observations  to  within  a  high  accuracy.  The 
prcolem  of  coordinating  tne  time-space  dimensions 
of  tne  Observations  vis-a-vis  tne  ' lme-space 

dimensions  of  tne  observaoles  is  a  key 

organizational  prcolem.  The  overriding  factor  in 
obtaining  a  suitable  space-time  synchronization  is 
tne  3ceed  of  ocmmunloat ion .  The  correlation  of 
3cme  ooservations  requires  that  a  large  volume  of 
information  oe  accurately  communicated  at  a  nign 
3oeed  to  meet  me  -equirement  of  real-time 


1*2.  Architecture  22  support  rtal.-t.ias 
distributed  mfe.-enclng. 

1*2*1  Hierarchical  extraction  InfCraaUca—  US 
"two-way  aioellne". 

As  described  in  the  introduction,  the  Bayesian 
paradigm  of  statistical  derivation  of  inferences 
consists  of  improving  the  state  of  knowledge 
through  iterative  feedback.  This  paradigm  implies 
a  "two-way  pipeline"  inference  architecture. 
Observations  are  first  processed  locally  to  extract 
some  target-features.  The  Information  derived  may 
be  delivered  to  a  higner-level  processing  algoritnm 
that  combines  tne  pieces  of  information  to  extract 
higner-level  associations  to  oe  delivered  to  r.igner 
processing  levels,  and  so  on.  The  upstream 
pipeline  is  an  information  compressor  turning 
billions  of  observational  bits  into  a  small  report 
to  Oe  delivered  to  the  user. 

The  downstream  pipeline  returns  i.nfprmation  aoout 
the  present  state  of  target  information,  to  oe  used 
as  a  prior  knowledge  to  support  lower-level 
processing.  (Note  that  tnis  two-way  pipeline  can 
map  user's  instructions  and  otner  external 
high-level  information  downstream.)  In  addition, 
higher  levels  can  identify  electronic  and  otner 
enemy  counter  measures  and  instruct  lower  levels  to 
respond  accordingly. 

It  may  be  possible  to  carry  cut  a  resource 
management  policy  wnich  adapts  to  tne  information 
availaoie  on  a  target  scenario.  tor  instance,  it 
is  possible  to  focus  tne  attention  of  directional 
sources  to  those  directions  from  whj.cn  targets  are 
expected  to  arrive.  It  is  also  possible  to  turn 
radars  on  and  off  wnen  targets  appear  or  disappear 
or  wnen  they  may  oe  endangered  Oy  smart  Oomos. 

How  should  we  structure  t  he  two-way  r, :  oe  1 :  ne  ? 

Hni  or.  laiacsasaa  au  la  £&  acsaaaaaa  22  1222 a 

level?  How  snould  we  use  tne  inferences  to  acr.ieve 
intelligent  resource  32aaggae.lt  3£i23X2 

1.2.2  Inferring  la  "rgai-t lag" . 

A  major  consideration  in  structuring  a  nierarcr.ical 
two-way  inference  pipeline  is  tne  space-time 
structure  of  tr.e  observations.  The  soatia. 
structure  of  tne  ooomunioat ion  and  processing 
resources  13  an  overriding  factor  in  deciding  ‘he 
allocation  of  functions  and  resDOnsioiiities  among 
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network  .nodes.  ««  snail  consider  f.ne  resulting 
orooJems  ;n  section  u .  However,  tne  requirement 
•cat  processing  snould  ccour  in  real-time  has 
implications  oeyond  questions  of  resource 
management  . 

Different  target-features  nave  different  temporal 
oenavior — some  cnange  rapidly,  otners  remain 
invariant.  me  notion  of  "real-time"  changes  from 
one  feature  to  anotner.  Accordingly,  the 
<nowi edge-oases  and  t.ne  inferring  processes  at  each 
level  in  tne  hierarchy  snould  reflect  the  temporal 
nature  of  the  knowledge. 

Another  aspect  of  tne  "real-time*  processing 
requirement  is  that  old  data  acquired  must  often  Pe 
discarded  to  permit  new  data  to  he  processed.  In 
deciding  wnether  to  discard  data  and  wnicn  data  to 
discard,  the  inference  process  must  evaluate  the 
relative  significance  of  the  data.  The  value  of 
information  ahout  a  target-feature  is  a  function  of 
noth  its  time  and  its  content. 

How  should  the  inference  processes  manage  their 
knowledge-bases  ia,  reflect  i&fi  ISaBcral  nature  2t 
Ud  knowledge  ml  ii  evaluate  significance  s£ 
data  sc  as  to  manage  local  processing  resources 
effectively? 


1-2.1  Communication  between  inferring  nrssBaaea.. 

In  tnis  section  we  discuss  semantic  aspects  of  the 
communication  between  inferring  processes.  Once 
responsibilities  are  allocated  to  the  different 
levels  of  the  two-way  inference  pipeline,  it  is 
necessary  to  provide  for  an  efficient  mechanism  to 
communicate  knowledge  between  the  different  levels. 
The  main  questions  are:  What  la  communicate?  and 

iai  ia  icaauiuaate? 

to  answer  the  first  question  we  should  first 
identify  the  objective  of  communication.  loosely 
soeaking,  the  objective  of  communication  is 
twofold,  to  update  the  knowledge  possessed  by  the 
respective  processes  and  to  direct  their  attention. 
Accordingly,  it  is  required  to  identify  which 
knowledge  may  be  required  by  wnicn  processes.  How 
snouid  that  knowledge  be  represented?  How  should 
one  inference  process  support  the  operation  of 
ethers?  Finally,  how  snould  higner  level  inferring 
processes  dynamically  focus  the  attention  of  the 
lower  levels? 

Anotner  class  of  problems,  arising  from  the 
ilstributad  nature  of  the  inference  processes,  is 
tne  proolem  of  data-assoclation.  How  do  procssses 
tnat  reason  aoout  a  muititarget  scenario  relate  the 
possibly  dlffersnt)  images  of  the  3ama  scenario 
that  they  may  have?  'What  global  view  may  be 

levelopad  and  how  may  it  be  ieveloped? 

These  are  the  major  issues  as  far  as  the  semantics 
of  communicating  inference  processes  is  concerned. 
In  section  4.1  we  snail  consider  the  more  general 
problem  of  communication  in  a  decentralized  system. 


!U  ,,ycaiI£g:'JfiS  12.  support  decentralization. 

In  addition  to  issues  arising  from  the  need  to 
develop  integrated  intelligent  signal-processing 
architecture,  it  is  necessary  to  understand  and 
address  issues  arising  from  the  decentralized 
nature  of  the  participating  processes.  A  line  of 
distinction  snould  be  drawn  between  the  "classical" 
notion  of  "distributed"  architecture  and  cur  needs. 
Our  main  concern  is  to  teach  net ’works  now  to  soLve 
problems  (e.g.,  as  described  in  the  previous 
sections)  together.  We  shall  call  this 
"decentralized"  computation  as  opposed  to  "P  4  V  " 
problems  of  classical  distributed  processing. 

Problems  of  decentralized  cooperation  are  beyond 
the  scope  of  present  day  practices  of 
network-protocol  development.  Our  problem  starts 
at  the  point  where  present  network  protocols 
terminate;  tnat  Is,  once  tne  Issues  of  how  to 
provide  the  best  communication  schemes  are 
resolved,  how  is  the  communication  facility  to  be 
used  to  obtain  optimal  cooperative  problem  solving? 

Our  main  concern  is  to  develop  protocols  fsr 
real-time  decentralized  cooperative  proDlem 
solving.  However,  since  the  Issues  Involved  in 
developing  real-time  communication  protocols  are 
far  from  being  resolved,  a  major  effort  is  required 
to  bridge  the  gap  between  existing  approaches  to 
high-level  protocols  and  tne  communication  needs  of 
DSN. 

Ii.  i 3  necessary  AfltfJdUCS  UBK  types  a£ 

The  function  of  communication  protocols  Is  to  use 
the  communication  resources  effectively  to  provide 
a  suitable  communication  medium  for  application 
processes.  The  application  processes  in  tne  DSN 
environment-  are  distinguished  by  two  important 
characteristics:  first,  they  have  to  process 
real-time  data  in  real-time  and,  second,  they 
require  close  cooperation  in  order  to  proceed  with 
tneir  individual  computations.  These  two 
characteristics  dominate  the  arena  of  DSN  protocol 
development . 

J-i .)  £iiacuan=acigatflil  tugn-iatal  aaaumiaa 

protocols. 

The  main  problem  is  that  we  nave  no  methodology  to 
design  and  implement  "network-computations".  How 
snould  computational  tasks  be  allocated?  How 
snould  tne  communication  mecnanism  be  used  to 
advance  the  computations?  How  should  we  express 
(linguistically  speaking)  communicating  algorithms? 
How  should  we  model  and  design  communicating 
computational  processes?  How  snould  the  protocol 
carry  out  an  efficient  resource  management? 

a-)-2  Problems  zL  real-time  crctocols. 

As  discussed  in  section  3.2.2,  the  characteristic 
feature  of  real-time  algorithsis  is  tnat  tne  lata 
they  process  carries  a  virtual  time  stamp:  it  loses 


its  value  as  time  progresses.  The  objective  of  a 
real-time  protocol  is  not  just  to  deliver  data  but 
to  deliver  as  much  data-value  as  possible  (here  the 
value  nas  to  do  with  both,  the  significance  of  the 
data  for  the  processes  and  the  time  at  wnich  the 
lata  was  produced).  This  objective  implies  that 
sucn  classical  protocol  objectives  as,  1001  reliable 
delivery  are  irrelevant.  There  is  no  point  in 
insisting  on  delivery  of  outdated  messages  and 
tying  up  communication  resources  to  serve  them  at 
the  expense  of  larger  delay  for  recent  data. 
Therefore,  a  real-time  protocol  may  discard 
outdated  messages  and/or  give  priority  service  to 
messages  according  to  their  "value": 

ouon  a  protocol  is  distinguished  from  classical 
protocols  not  only  in  its  objectives  but  also  in 
its  pos3iole  requirements  for  implementation.  For 
instance,  a  classical  principle  of  protocol  design 
i3  tne  principle  of  "transparency".  Namely, 
service  of  messages  ny  low-level  protocols  should 
ce  independent  of  their  contents.  A  real-time 
protocol,  on  tne  other  hand,  may  be  required  to 
provide  service  whose  quality  depends  upon  the 
value  of  the  data  content  of  the  message.  This  in 
turn  implies  tnat  high-level  protocols  need  to  be 
aoie  to  pass  content-dependent  service  instructions 
to  iow-level  protocols  all  the  way  down. 

In  addition,  it  is  required  that  the  application 
processes  can  instruct  the  communication  protocol 
aoout  now  value  snouid  oe  assigned  to  data.  That 
is,  tne  criteria  for  comparing  the  value  of 
different  messages  are  beyond  the  scope  of  the 
protocol . 

Therefore,  the  problems  faced  by  real-time  protocol 
developers  are  substantially  different  from 
existing  practices.  Many  new  concepts  and  methods 
will  nave  to  be  introduced  to  structure  a  good 
real-time  protocol. 

'jraacisatignal  arcBlsaa  si  deaeatrallaed 

architecture. 

Little  i3  known  about  efficient  organizational 
structures  to  perform  tasks.  The  problem  has  been 
qualitatively  studied  by  economists  and  recently  by 
ocntrol-bheoretician3.  However,  the  problem  is  far 
from  being  understood  in  spite  of  its  significance 
to  many  fields  of  knowledge.  The  economists  have 
been  mainly  interested  in  descriptive  models  that, 
due  to  the  complexity  of  human-organizations,  are 
usually  very  crude.  Control  theoreticians,  on  the 
other  hand,  nave  restricted  their  interest  to 
systems  possessing  a  sufficiently  simple 
mathematical  description  and  even  for  those 
systems  tne  problems  of  decentralization  nave  not 
been  adequately  addressed. 

with  tne  advent  of  computer-communication  networks, 
the  problem  of  "programming  networks"  and 
organizing  them  to  perform  real-time  computations 
will  oecoae  significant.  Being  tne  first 
"special-purpose"  computer  networks  for  real-time 
distributed  processing,  DSN  can  Pe  expected  to 
ccntriouti  significantly  to  the  understanding  cf 
organ! zationai  problems  cf  large-scale 
decentralized  systems. 


1  Decentralized  computations. 

How  should  we  organize  a  system  of  "myopic" 
distributed  processes  to  perform  a  cooperative 
global  problem  solving?  How  should  the  processes 
reacn  agreements?  Should  control  flow  between 
processes?  How  3nould  they  coordinate  local 
decisions  to  optimize  glooai  objectives?  How 
snouid  control  flow  between  processes?  How  3nould 
we  secure  global  convergence  of  local  iterations  to 
a  desirable  state? 

a. a. a  Organizational  structures. 

How  should  we  establish  hierarchies  of 
computational  responsibilities?  How  snouid  we 
tailor  the  organizational  structure  to  suit  tne 
nature  of  the  task  to  be  performed  and  the 
availability  of  resources?  How  should  processing 
and  communication  resources  Pe  allocated  to  support 
effective  decentralized  processing?  lihat  are  the 
performance  parameters  and  tne  tradeoffs  !e.g., 
between  communication  and  processing)?  How  oan  a 
large-scale  system  adapt  to  a  global  state  to 
achieve  glooai  objectives  on  the  basis  of  local 
observations  only? 

Problems  of  decentralized  processing  will  demand  a 
major  interdisciplinary  research  effort.  The 
methodologies  developed  to  address  the  specific 
problems  of  DSN  can  be  expected  to  bear  major 
significance  in  the  general  understanding  and 
development  of  function-oriented  organizations  for 
large-scale  computational  systems. 


S.  PROBLEMS  X  MODELLING  AND  DESIGN 

Here  we  nave  a  large  number  of  standard  problems, 
some  of  which  may  have  a  specific  DSN  flavor. 


1.  Establishing  DSN  to  maximize  coverage. 

Topological  optimization  of  sensor 
locations  and  distribution  of  sensor 
power.  Choice  of  sensor  power  to 
optimize  coverage.  Alternative  sensors' 
configurations  and  mixtures. 

2.  Performance  and  design  parameters. 

Identify  critical  performance  and  design 
parameters  and  their  relations. 
Performance  optimization.  Problems  of 
reliable  operation  and  survivability. 


6.  CONCLUSIONS. 

Are  open  for  discussion. 


The  Plolicgrapny  reflects  essentially  my  personal 
taste.  There  nas  oeen  no  attemDt  to  produce  an 
extensive  list  of  references.  My  choice  nas  peer. 


53 


directed  mainly  by  the  need  to  expose  different 
technologies  end  methodologies.  The  answers  to  DSN 
proolems  cannot  be  found  In  tnese  references, 
although  some  fundamental  relevant  Ideas  nay. 


7.1  Signal  processing  tecnnology 
VAN-TREES  H.  L. 

Detection.  Estimation.  and  Modulation 
Theory.  Part  I,  J.  Wiley  A  Sons,  Inc., 
1963. 

The  "Bible”  of  signal  estimation  theory.  Chapters 
2,  3,  parts  of  4  and  3,  contain  an  excellent 

Introduction  to  problems  of  filtering  and 
estimation.  The  second  half  of  part  III  contains 
an  extensive  discussion  of  problems  of  parameter 
estimation  from  radar  returns. 

CAPON  J. 

"High-Sesolution  Frequency-Wavenumber 

Spectrum  Analysis",  Proc.  IEEE.  Vol.  57, 

NO.  3,  August  1969. 

L.ACCSS  a.  T. 

"Review  of  Some  Technique  for  Array 
Processing" ,  in  Exploitation  Seismograph 
Networks.  Ed.  Beauchamp,  NATO  Advanced 
Study  Institutes  Series,  Series  E:  Applied 
Sciences  No.  11,  Nordhoff  -  Leiden  - 
1975. 

3oth  papers  discuss  the  high-resolution  technique 
for  analyzing  the  map  of  the  power-spectrum 
function,  over  the  wave-number  plane.  The 
technique  is  the  one  used  to  extract  source  azimuth 
by  cress-correlating  the  observations  of  the 
acoustic  (or  seismic)  field  as  measured  by  an 
acoustic  array.  The  first  paper  is  highly 
theoretical,  the  second  is  more  practical  and  is 
the  best  starting  point. 

BAfl-iHALCM  Z. 

"Tracking  Methods  in  a  Multitarget 
Environment  ",  issg  AjaaaacUaaa  sa 
Automatic  Control.  Vol.  AC-23,  No.  9, 
August  1973. 

A  good  survey  of  the  problem  with  good  references. 
The  problem  of  Multitarget  Detection  requires 
intelligent  detection  algorithms  that  resolve  the 
problem  of  data-assooiation  (i.e.,  associating  data 
from  sequential  observations) .  This  last  problem 
is  described  and  analyzed  in 

1ITTL2R  A.  W. 

"An  Iptimal  Data  Association  Problem  in 
Surveillance  Theory",  IEEE  Trans.  Mil . 
Electron. .  Vol.  MIL-3,  op.  125-1 39,  April 

’  9b" . 
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Retrospective  View.  Department  of  Computer 
Science,  CMU,  Report  No.  CMU-CS-78-1 17 , 

May,  1978. 

Describes  Hearsay-II,  a  speeon  understanding 
system.  Of  particular  interest  to  the  DSN  project 
is  the  Blackboard  Architecture:  A  communication 

mechanism  between  the  inference  processes  !  eacn 
considered  an  "expert"  in  performing  certain 
inferences),  displaying  tne  present  state  of 
knowledge. 

SMITH  S.  a.  A  DAVIES  R. 

Distributed  Problem  Solving:  Ifca  C intrant. 

Net  Approach.  Computer  Science  Department, 
Stanford  University,  Report  No. 
STAN-CS-7S-647,  September  1973. 

An  approach  to  distributed  problem  solving  is 
described  whereby  processes  compete  for  the 
services  of  each  other  by  bidding.  The  bids  are 
used  by  the  serving  mechanisms  to  estaolish 
priority  service. 

2*3.  Decentralized  cooperative  problem  solving 

7.3.1  I&ft  view  al  economists  ml  decision  .taeBCT 
MARSCHAK  A  RADNER 

Eeonomi  c  Theory  of  Teams .  Cowles  Foundation 
Monograph  22,  Tale  University  Press,  1972. 

The  first  part  of  the  book  is  dedicated  to  an  in 
depth  study  of  the  relation  between  statistical 
decision  theory  and  rational  decision  making  in 
organizations.  The  second  part  is  a  mind-provoking 
study  of  cooperative  statistical  decision  making. 
Still  unsurpassed  in  depth. 

LUCE  A  RAI5FA 

and  Decisions.  J.  Wiley  A  Sons,  Inc., 

1967. 

A  critical  introduction  to  game  theory  and 
statistical  decision  theory.  There  is  an  extensive 
introduction  to  problems  of  group  decision  making, 
culminating  in  chapter  l«,  wnere  some  of  the  major 
dichotomies  are  exposed. 

7.3.2  2H  view  control  Ugg 
IEEE 

Trans .  Auto  ■  Control  ■  Special  Issue  on 
Large-Scale  Systems  and  Decentralized 
Control,  Vol.  AC-23,  No.  2,  April  1973. 

The  survey  article  by  Sandel  etai.  (pp.  :G3-’23; 
is  comprehensive  and  contains  an  extensive 
bibliography  (Attention:  read  the  conclusion  of 
the  survey).  Dther  papers  range  oetween  good  *o 
uninteresting.  The  paper  Py  Hassan  et  al .  '.pp. 
262)  pn  decentralized  computation  of  a  Kalman 
filter  oesrs  some  significance  to  tne  proolsm  cf 
position-tracking.  The  paper  by  Ho  et  s..  ?o. 
•05)  illuminates  some  cf  tne  major  theoretical 
difficulties. 
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'__i  :  J3c.it  er  Connunlcatljn  Set  wonts. 

McQDILCAN  1  CSRF 

"A  Practical  View  of  Computer 
Comiunications  Protocols",  Tutorial,  IEEE 
Computer  Society,  1973. 

The  only  comprehensive  introduction  to  prooiems  of 
protocol  design.  Contains  reproduction  of  many 
important  papers  on  tne  supject.  The  paper  Py 
J.  McQuillan  "The  Evolution  of  Message  Processing 
Techniques  in  tne  ARPA  network"  is  particularly 
recommended. 


Proceedings,  Special  Issue  on  Packet 
Communication  Networks,  Vol.  66,  No.  11, 
Novemoer  1973. 

A  fresn  collection  of  invited  papers  covering  the 
full  3pectrum  of  packet-switched  Computer 
Communication  networks.  The  tutorial  on  protocols 
,pp.  !;Uo)  and  the  paper  on  high-level  protocols 
1*371)  are  recommended  as  a  prelude  to  DSN  protocol 
development.  The  description  of  PRNets  vpp.  1463) 
is  essential  to  the  understanding  of  DSN 
communication. 


KAHN  3.  E. 

"The  Organization  of  Computer  Resources 
into  a  Packet  Radio  Network",  IEEE  Trans. 
Communications.  Vol.  CCM-25,  No.  1,  Jan. 
1977. 

A  well  organized  exposition  of  tne  major  issues  in 
PRNets . 


1*5  Resource  management  la  computer  systems 
KLEINROCK  L. 


Queueing  Systems.  Vol.  II,  J.  Wiley  i 
Sons,  Inc.,  1976. 

The  first  volume  can  serve  as  a  general 
introduction  to  queueing  theory.  The  second  volume 
is  devoted  to  applications.  Chapter  4  describes  a 
menu  of  scheduling  algorithms  and  analyses  of  their 
queueing  performance.  Chapters  5  and  6  contain  an 
introduction,  as  yet  unsurpassed,  to  resource 
management  proDlems  in  computer  communication 
networks. 
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A  general  survey  of  sensor  alternatives  for 
detecting  low  flying  aircraft  and  obtaining  tar¬ 
get  metrics  will  be  presented. 

There  are  two  distinct  kinds  of  sensors. 
They  are  (l)-active,  and  (2)-passive.  Passive 
sensors  measure  radiation  produced  directly  by 
the  target  of  interest.  The  major  radiations  of 
value  are  (l)-sound,  (2)-infared  from  heat 
sources,  and  (3)-altimeter  or  radar  radiations 
from  the  aircraft.  The  primary  active  sensor  of 
interest  is  radar, of  which  there  are  very  many 
alternatives. 

Passive  Sensors. 

Fallowing  are  some  of  the  source  charac¬ 
teristics  which  relate  to  passive  sensors. 
First,  all  sources  have  very  complicated  radia¬ 
tion  patterns.  The  spectrum  and  intensity  of 
radiated  energy  will  vary  very  much  depending 
upon  the  orientation  of  the  observing  sensor 
with  respect  to  the  aircraft.  That  radiation 
pattern  is  also  difficult  to  predict  in  detail 
with  much  certainty.  As  acoustic  or  infared 
sources  aircraft  are  continuous  radiators.  The 
best  a  sensor  can  do  is  to  extract  a  spectrum 
and  estimate  the  direction  from  which  the  radia¬ 
tion  is  comming.  Crosscorrelations  between 
widely  spaced  sensors  could  in  theory  determine 
relative  times  of  arrival  of  signals  at  the  two 
sensors  but  radiation  patterns  3r.d  transmission 
path  effects  tend  to  make  this  impossible.  The 
signals  from  the  aircraft  altimeter  or  radars 
ire  generally  pulses  and  a  sensor  can  determine 
both  direction  and  time  of  arrival.  Of  course 
this  source  is  available  cnly  if  *:.e  aircraft  is 
using  the  radiating  equipment. 

Microphones  can  be  used  to  gather  data  from 
which  spectra  can  be  measured.  The  problem  is 
to  determine  if  the  spectrum  is  from  an  aircraft 
or  is  Just  some  other  noise  in  the  area.  One 
help  for  this  13  to  use  a  form  of  directional 
microphone  or  3mall  array  to  measure  the  direc¬ 
tion  from  wnich  sounds  are  comming.  Much  more  on 
the  detection  procedure  and  the  characteristics 
of  acoustic  sources  and  sensors  will  be  found  in 
the  papers  by  Landers  and  Demko  in  this 
workshop . 

lie  also  note  that  sound  from  a  low  flying 
aircraft  couples  into  the  ground  below  the  air¬ 
craft  to  create  seismic  waves  wr.ich  can  be  meas¬ 
ured  by  seismometers.  In  general  the  detection 
range  for  those  3el3mic  signals  will  be  much 
less  than  the  range  for  acoustic  signals.  How¬ 


ever  seismometers  might  be  of  value  to  identify 
non  aircraft  sources  3uch  as  land  vehicles. 

Infared  sensors  measure  the  amount  of  radi¬ 
ation  which  hits  them  in  the  Infared  region. 
Several  different  band3  can  be  used  to  obtain  a 
simple  signature  of  the  radiation.  Such  sensors 
are  combined  with  optics  to  view  a  very  small 
field  of  view.  This  field  of  view  is  scanned 
and  a  heat  picture  of  the  region  near  the  sensor 
is  obtained.  Aircraft  should  be  local  hot  spots 
in  that  picture.  Thus  passive  infared  can  ob¬ 
tain  a  signature  and  direction.  In  general 
these  can  be  quite  complicated  sensors  and 
eurrect  technology  seems  to  require  cryogenic 
cooling  to  achieve  the  sensitivity  for  aircraft 
at  several  kilometer  ranges. 

Both  acoustics  and  infared  must  contend 
with  transmission  path  effects.  For  infared  the 
transmission  path  effects  primarily  involve  at¬ 
tenuation  and  line  of  sight  limitations.  At¬ 
tenuation  is  not  a  major  issue  for  acoustics  in 
the  frequency  band  of  interest.  Line  of  sight 
is  some  limitation  but  not  nearly  so  serious. 
The  major  acoustic  path  effects  result  from  the 
velocity  structure  of  the  medium  ar.d  are  dis¬ 
cussed  in  detail  by  Landers. 

The  major  problem  for  both  acoustics  and 
infared  is  to  detect  and  identify  the  target  in 
the  presence  of  ail  other  acoustic  or  infared 
noise  sources  in  the  area.  Given  the  ability  to 
do  Chat  they  both  give  spectral  information  and 
direction.  However,  due  to  the  slow  speed  of 
propagation  of  sound,  the  measured  acoustic 
direction  is  where  the  aircraft  was  and  not 
where  it  is. 

Active  Sensors . 

Radar  of  course  operates  by  radiating  a 
signal  which  interacts  with  the  target,  is  rera¬ 
diated  from  the  target,  and  detected  by  the  sen¬ 
sor.  The  r3dar  cross  section  of  a  target  is  a 
measure  of  the  reradiation  cnaracteristcs  of  the 
target.  It  will  fluctuate  wildly  as  a  function 
of  the  orientation  of  aircraft  with  respect  to 
both  the  incident  energy  and  the  detecting  sen¬ 
sor.  The  corresponding  characteristic  for  pas¬ 
sive  sensors  is  the  non  isotropic  radiation  pat¬ 
tern  mentioned  above. 

The  signal  to  noise  ratio,  and  thus  ability 
to  detect,  a  target  in  free  space  depends  upon 
the  amount  of  radiated  power,  the  fourth  rower 
of  the  range  to  the  target  and  the  target's 
cross  section.  However  for  low  flying  aircraft 
there  are  additional  important  considerations. 
In  particular  these  are  terrain  masking, 
clutter,  and  multicathlng.  Terrain  masking  is 
obvious.  Depending  upon  the  topography  ar.c  now 
well  sited  a  raoar  is  tsed  the  line  of  sigr.t 
limit  for  very  low  targets  may  ce  only  a  few 
tens  of  kilometers  or  less.  This  litigates  in 
favor  of  relatively  small  radars.  Two  kir.c3  :f 
clutter  are  cf  interest.  Inforcuratelv  t.ne 
ground  ar.d  other  stationary  things  attacr.et  to 
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it  reradiate  signals  Just  like  real  targets. 
These  signals  are  called  ground  clutter  and  must 
Oe  dealt  with.  Other  clutter  results  from  mov¬ 
ing  objects.  The  most  important  of  these  are 
weather,  trees,  and  birds.  Finally,  the  signal 
reradiated  from  a  low  flying  aircraft  can  skip 
on  the  gound  to  get  to  a  receiver  as  well  as  ar¬ 
rive  oy  the  direct  path.  This  is  called  mul- 
tipathing. 

The  variety  of  radars  which  can  be  con¬ 
structed  is  very  great. 

First  there  are  options  on  the  radar  signal 
shape.  Simple  pulsed  radars  measure  the  time 
delay  from  transmission  to  the  reception  of  the 
reradiated  signal.  Continuous  wave  (CW)  radars 
measure  only  the  frequency  shift  of  the  reradi¬ 
ated  signal  from  moving  targets.  From  thi3 
doppler  measurement  target  velocity  information 
can  be  obtained.  Also,  u3e  of  doppler  is  a 
prime  way  to  operate  in  ground  clutter  with  mov¬ 
ing  targets.  Hybrid  pulsed  doppler  radars  can 
be  used  to  obtain  both  range  and  doppler  infor¬ 
mation.  There  are  many  other  alternatives  but 
these  will  not  be  discussed. 

Second  there  is  the  choise  between  monos¬ 
tatic  or  multistatic.  In  a  monostatic  radar  the 
transmitter  and  receiver  are  colocated.  Thus 
range  measurements  are  the  range  from  the  sensor 
to  tne  target  and  velocity  measurements,  or 
velocity  used  to  determine  if  a  target  is  mov¬ 
ing,  are  the  radial  velocity  with  respect  to  the 
3ensor.  In  multistatic  (or  bistatic  if  there  is 
only  one  receiver)  the  transmitter  is  at  one  po¬ 
sition  and  the  receivers  are  at  a  different  po¬ 
sition.  This  is  not  a  common  mode  for  radars. 
There  can  be  difficult  timing  and  coordination 
problems  for  multistatic  operation. 

Finally  there  is  the  issue  of  directionali¬ 
ty.  Transmission  antennas  and  receiving  anten¬ 
nas  generally  are  quite  directional.  This  tends 
to  conserve  power  and  is  very  helpful  with 
respect  to  clutter.  Traditionally  the  direc¬ 
tionality  13  also  used  to  obtain  directional  in¬ 
formation.  Dnfortunately  good  directionality 
requires  large  antennas  and  is  somewhat  at  odds 
with  the  DSN  concept.  Fortunately  the  use  of 
multiple  range  measurements  for  target  location 
is  generally  superior  to  what  can  be  achieved 
crrm  a  single  directional  radar. 
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AflSXSACX 

Two  acoustic  arrays  10  lan  apart  are  used  to  lo¬ 
cate  a  sub-sonic  aircraft  in  the  horizontal 
plane.  The  arrays  measure  azimuth  of  sound  from 
the  aircraft.  If  the  latter  is  flying  very 
slowly,  it  is  located  at  the  intersection  of 
straight  lines  drawn  from  the  sensors  in  the 
direction  of  the  azimuths  measured  at  the  time 
the  plane  is  to  be  located.  If  the  plane  is 
flying  faster,  the  straight  lines  become  curved 
because  of  the  finite  speed  of  sound,  and  meas¬ 
urements  cade  over  intervals  of  time  must  be 
used  instead  of  measurements  made  at  a  single 
tice.  3ut  the  general  picture  remains  very  much 
the  same.  The  effects  of  azimuth  errors  are 
similar  for  slow  and  fast  aircraft,  though 
geometrical  diagrams  relating  error  magnitude  to 
aircraft  location  distort  as  the  aircraft  velo¬ 
city  changes.  Aircraft  height,  which  causes  no 
error  for  very  slow  aircraft,  causes  only  small 
errors  for  fast  aircraft  flying  below  1  km. 


laXaQPUCTIQM 

Our  goal  Is  to  snow  that  an  aircraft  can  be  lo¬ 
cated  in  the  horizontal  plane  using  data  from 
jnly  two  acoustic  sensors.  Each  sensor  is  a  mi- 
cropnone  array  whose  outputs  are  of  the  form 
’sound  is  coming  from  azimuth  (horizontal  direc¬ 
tion)  a  at  time  t."  The  two-sensor  method 
described  below  contains  ideas  basic  to  any  DSN 
tracking  low  flying  aircraft  with  acoustic  array 
3ensors . 


locating  m  AiaaafT 

Suppose  we  have  two  sensors,  3,  and  S,,  located 
10  km  apart,  and  both  at  the  same  height,  which 
is  zero  in  our  coordinate  system.  Suppose  a 
piane  is  flying  around  somewnere,  and  the  az¬ 
imuth  vs.  time  curves  for  the  two  sensors  are 
as  in  Figure  1. 

First,  we  want  to  find  the  plane  at  time  t,  as¬ 
suming  (with  no  particular  reason)  that  the 
plane  is  at  height  z  s  3.  Consider  Just  the  S, 
data.  The  sound  emitted  by  the  plane  at  time  t 
must  arrive  at  3.  at  seme  time  t1  >  t.  Let 
be  the  azimuth  reading  measured  for  time  t,. 
Then  at  time  t  the  plane  is  distance 


R1  =  (t1  -  t)' (speed  of  30und) 
from  S.J,  or  at  horizontal  distance 


!  2  2 
r,  s  \!R,  -  z 

from  S1 .  Given  t^,  we  have  found  d1  and  r1 , 
which  specify  a  point  in  the  horizontal  plane. 
The  set  of  all  pairs  (r1t  given  by  all 
C1  >  t  generates  a  curve  of  possible  positions 
in  the  horizontal  plane  for  the  airplane  at  time 
t.  We  call  this  latter  curve  the  possible  posi¬ 
tion  curve  for  S.,  t,  and  z*0.  Figure  2  shows 
such  curves  for  t  s  -30,  -26,  -22,  -18,  -14,  and 
-10  seconds,  z  =  0,  and  both  5^  and  Sj.  Note 
that  the  possible  position  curve  for  a  sensor 
originates  at  that  sensor,  which  is  the  point  on 
the  curve  corresponding  to  t1  *  t,  unless  the 
available  measured  azimuth  curve  begins  after 
time  t. 

At  time  t  the  plane  must  be  at  the  Intersection 
of  the  S1  and  3,  curves  for  given  t  and  z,  as¬ 
suming  that  the  plane  is  actually  at  height  z. 
Thus  for  each  t  we  get  a  plane  location.  Con¬ 
necting  these  locations  with  straight  lines 
gives  the  computed  plane  path  displayed  in  Fig¬ 
ure  2.  The  computed  plane  path  in  Figure  2  is 
Just  about  equal  to  the  actual  plane  path  so  the 
actual  path  Is  not  displayed.  The  plane  is  ac¬ 
tually  flying  a  straight  lino  at  0.9  Mach  and 
height  0.  t  km.  The  difference  between  the  as¬ 
sumed  height  z  *  0  and  actual  height  3.1  km 
results  in  negligible  error. 

The  possible  position  curves  for  t  =  -22  and  -18 
have  multiple  intersections:  the  extra  intersec¬ 
tions  are  labelled  G  denoting  ghost.  A  crude 
automatic  algorithm  has  been  used  in  all  our  ex¬ 
amples  to  distinguish  real  position  measurements 
from  ghosts  3hd  trace  the  computed  path. 


AZIMUTH  ERRORS 

We  now  consider  the  effects  of  errors  in  azimuth 
measurement.  Suppose  each  such  measurement  A  is 
replaced  by  a  range  of  measurements  (e-e,  d*e) 
wnere  e  is  the  maximum  possible  error  in  A. 
Then  each  possible  position  curve  is  replaced  by 
a  region  shaped  like  a  highway.  The  boundaries 
of  this  region  are  Just  the  possible  position 
curve  rotated  by  »e  and  -e  degrees  around  the 
sensor.  Figure  3  shows  these  possible  position 
regions  for  t  =  -30,  — i 9 ,  and  -6  with  z  *  3  and 
e  s  5  degrees.  Also  shown  are  the  computed 
path,  plus  the  ghosts  for  t  =  -i3. 

For  particular  t  and  z  the  intersections  of  the 
S,  and  3,  possible  position  regions  determine  an 
intersection  region  where  the  plane  will  pe 
found.  This  region  generally  contains  all  plane 
locations  and  ghosts  found  ir.  the  previous  sec¬ 
tion  for  given  t  and  z.  Basically,  ghost3  are 
the  result  of  nearly  tangentially  intersecting 
possible  position  curves,  and  are  associat'd 
with  an  elongated  intersection  region  containing 
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the  true  postion  plus  ghosts.  Noise  in  the  az- 
iauth  readings  can  cause  gnosts  anywhere  in  the 
intersection  region. 

The  intersection  angle  of  the  S1  and  S2  possible 
position  curves  indicates  whether  the  Intersec¬ 
tion  region  will  be  elongated.  For  mathematical 
reasons  we  consider  this  angle  to  have  a  range 
from  3  to  360  degrees,  with  180  degrees  being 
the  usual  angle  for  tangential  intersection. 
Angles  between  180  -  2e  and  1 80  »  2e  degrees 

will  generally  result  in  elongated  intersection 
regions,  whereas  angles  near  90  or  270  degrees 
will  generally  result  in  saall  rectangular  in¬ 
tersection  regions.  The  intersection  angle  can 
be  computed  as  follows.  From  the  point  at  which 
the  plane  is  located  draw  three  velocity  vectors 
' see  Figure  A ) : 

v  The  plane's  apparent  horizontal  velocity. 

v.  Points  at  3.  and  has  magnitude 
' speed  of  sound)/cos  91  where  81  is  the 
elevation  angle  of  the  plane  the  given 

point  '.depends  on  z). 

v.  Eimilar  to  v1  but  for  S2  and  82. 

Then  the  direction  of  the  5,  possible  position 
curve  going  through  the  point  is  w1  =  v  -  v1  and 

of  tne  3.  curve  is  w,  =  v  -  •/,. 

a  2  2 

As  an  aside  we  indicate  why  the  procedure  Just 
given  works.  At  time  t  the  plane  is  at  the 
given  position  emitting  sound  received  by  S,  at 
time  t,  and  angle  At  time  t+dt  the  plane 

na3  moved  by  an  incremental  position  dt’v  emit¬ 
ting  3ound  received  by  3^  at  time  t1*dt1  and  an¬ 
gle  Oj-rdd..  let  w  be  a  vector  directed  along 
the  possible  position  curve  for  S1  such  that 
moving  an  incremental  position  dt-w1  from  the 
original  position  will  also  correspond  to  the 
3aoe  change  (dt,,  dd.)  in  the  azimuth  curve  for 
3..  Consider  the  local  rectilinear  coordinate 
3ysteo  i_,  1.,  i^  at  the  aircraft  location  asso¬ 
ciated  with®  global  spherical  coordinates  with 
origin  3,,  where  i^  is  the  unit  vector  directed 
from  3,  '  to  the  given  position,  i^  is  the  unit 
vector  in  the  direction  of  increasing  d,  and  ia 
is  the  unit  vector  in  the  direction  of  increas¬ 
ing  9.  Then  if  p,  is  the  horizontal  distance 
between  3,  and  the  plane  location, 

dd,  s  dt  (v'i^)  /  r1 


dt ,  =  dt  *  dt  (v'i  /  speed  of  sound) 

dt,  »  dt  (w ,‘i  /  speed  of  sound). 

1  ■  P 

These  last  equations  imply 


i 

*  i_  =  speed  of  sound  *  v*T 


Ih*  equations  just  given  imply  w1  =  v  -  v,  -where 
Vj  is  chosen  so  that 

speed  of  sound  *  -  v^'i 

0  =  vv 

While  v  is  the  apparent  horizontal  velocity  for 
the  computed  path,  and  not  the  plane's  actual 
horizontal  velocity,  we  will  ignore  the  distinc¬ 
tion  here  because  the  difference  between  these 
velocities  will  be  small  as  long  as  the  error  in 
position  due  to  misestimated  height  z  is  small. 
(If  the  position  were  accurate  both  these  velo¬ 
cities  would  be  the  same,  because  they  give  the 
same  rates  of  azimuth  change  for  S,  and  S2). 
Note,  however,  that  an  actual  vertical  component 
to  the  plane  velocity  is  equivalent  to  a  radial 
horizontal  component,  because  it  projects  on  i_, 
but  this  equivalent  horizontal  component  is  dif¬ 
ferent  for  each  sensor,  and  cannot  be  expressed 
by  changing  the  apparent  horizontal  velocity  of 
the  plane.  So  our  calculations  are  not  exact 
unless  the  plane  actually  has  zero  vertical 
velocity,  or  unless  the  assumed  height  z  =  0. 

For  given  z  and  v,  curves  of  constant  intersec¬ 
tion  angle  can  be  plotted  in  the  horizontal 
plane.  These  constant  intersection  angle  curves 
have  sensor  as  end  points,  and  are  show  in  Fig¬ 
ure  5  for  z  *  0  and  the  actual  plane  velocity  v. 
'We  call  the  region  in  which  the  intersection  an¬ 
gle  lies  between  180  -  2  a  and  180  *  2e  degrees 
the  region  of  linear  ambiguity,  because  in  that 
region  plane  position  can  only  be  localized  in 
one  dimension. 

Comparison  of  Figures  3  and  5  suggests  that  the 
intersection  angle  is  not  a  very  precise  measure 
of  how  well  a  plane  is  localized.  The  region  of 
linear  ambiguity  is  nonetheless  a  useful 
descriptive  concept. 

Figure  6  shows  the  computed  paths  of  the  four 
corners  of  the  intersection  regions  for  times 
including  those  of  Figure  2.  The  Intersection 
regions  tend  to  lie  between  the  outermost  of 
these  four  paths.  One  clearly  sees  the  changes 
in  length  of  the  intersection  regions  as  inter¬ 
section  angle  changes.  Near  the  region  of 
linear  ambiguity,  the  corners  of  the  intersec¬ 
tion  regions  exchange  positions  relative  to  each 
other,  with  their  computed  paths  crossing  over 
each  other. 


ERRORS  I2L  ASSUMES  HE13HT 

Figure  7  displays  the  computed  plane  paths  for 
three  different  assumed  heights,  z  -  2.  1,  and  2 
km.  Not  much  change  is  observed  as  z  changes, 
except  ir.  the  region  of  linear  ambiguity. 

Figure  3  is  just  Figure  2  but  with  assumed 
height  c  =  2  <n  instead  of  z  s  3.  Note  that  not 
only  have  the  ghosts  disappeared,  but  there  is 
also  no  intersection  at  all  of  the  possible  oo- 
3ition  curves  far  t  *  -13  seconds.  Errors  in 


azimuth  can  also  eliminate  intersections. 

Figure  9  displays  possible  position  curves  for 
t  =  0  and  z  =  0 ,  1 ,  and  2 .  Changes  in  z  only 
cause  changes  in  the  radial  distance  to  the  sen¬ 
sor,  and  therefore  change  the  possible  position 
curve  significantly  only  when  that  curve  is  per¬ 
pendicular  to  the  radial  direction  from  the  sen¬ 
sor.  Maximum  perpendicularity  is  achieved  by 
fast  (nearly  Mach  1.0)  planes  flying  nearly  to¬ 
ward  the  sensor.  But  if  such  a  plane  is  between 
the  two  sensors,  it  will  have  a  nearly  radial 
possible  position  curve  for  the  other  sensor, 
and  consequently  the  possible  position  curve  for 
the  other  sensor  will  not  change  much  with 
changing  z.  So  changing  z  in  Figure  9  moves  the 
intersection  point  along  one  of  the  two  inter¬ 
secting  possible  position  curves. 

Changing  z  induces  a  radial  translation  in  a 
possible  position  curve  very  similar  to  the  ra¬ 
dial  translation  induced  by  changing  t.  Thus  in 
Figure  9,  changing  z  moves  the  intersection 
point  along  the  actual  path.  As  a  result, 
misestimating  z  does  not  misestimate  the  plane 
path,  but  only  the  times  at  which  the  plane  is 
at  various  points  on  that  path. 

For  planes  flying  between  the  sensors  changing  z 
generally  results  in  a  position  error  parallel 
to  the  plane  path. 


Time  series  azimuth  measurements  from  Just  two 
sensors,  plus  an  estimate  of  plane  height,  can 
be  used  to  locate  a  plane  within  a  box,  the  in¬ 
tersection  region  of  two  possible  position  re¬ 
gions.  For  approximately  half  of  a  typical 
plane  path  this  intersection  region  is  a  nicely 
behaved  rectangular  box,  but  elsewhere  the  re¬ 
gion  is  elongated  and  may  become  U  shaped.  The 
major  exception  to  this  rule  occurs  when  the 
plane  flies  along  the  straight  line  through  the 
sensors,  passing  very  near  both  sensors. 

For  two  sensors  10  km  apart  and  planes  flying  no 
higher  than  1  <m,  the  affect  of  misestimating 
the  plane  height  is  usually  minimal. 


Fig.  1.  Azimuth  angles  vs.  time  measured  by 
3ensors  3t_  and  3c  for  3one  aircraft 
to  be  located. 
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3.  Possible  position  for  t  from  -20  tc  - 
sec.  There  i3  one  curve  for  each  ser. 
3^  and  3o,  and  intersections  of  these 
curves  are  either  pcir.t3  cn  computed 
plane  path  'nearly  straight  line;  :r 
ghosts  marked  .  Assumed  plane  hei 
is  :  *  ),  actual  height  i3  0.1  ,-ut,  cl 
velocity  i3  Mach  0.?. 
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Possible  position  regions  and  intersec¬ 
tion  regions  for  tines  -30,  -18,  and  -d 
sec.  iomputad  plane  path  (straight 
line;  and  gh03ts  (narked  Z)  for  these 
tines  are  also  plotted.  Possible  posi¬ 
tion  curves  with  assumed  errors  e  of 
*5°  and  -j3.  z  »  3  as  for  Pig.  2. 


v  is  plane'3  apparent  horioontal  velocity. 

i3  directed  towards  3t_,  v?  towards 
3p._  If  assumed  height  z  »  Z,  magnitude 
of  v.  and  v,  is  speed  of  sound.  wj_  is 
directed  along  3i  possible  position 
curve,  and  w-  along  3,  possible  position 
curve. 


Pig.  5.  iurves  of  constant  intersection  angle, 
for  angles  from  50°  to  21C°  in  13° 
increments.  Plane's  actual  velocity 
is  used  with  assumed  height  z  =  Z. 
Computed  plane  path  and  ghosts  are 
displayed.  Pegion  of  linear  ambiguity 
lies  between  173°  and  190°  curves. 
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?ig.  *5.  "onputed  paths  of  :cmers  of  intersect!: 

regions  for  errors  e  of  and  -:3  in 
azimuth  angle,  assumed  height  z  *  ", 
and  times  from  -5C  to  +50  sec  in  —sec 
increments.  Intersection  regions  gener¬ 
al!/  lie  betveen  outermost  paths.  P.egi. 
of  linear  ambiguity  i3  also  sr.c 
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Abstract 

Data  from  a  5  meter  array  of  microphones 
has  been  collected  and  analysed  in  the  band  20- 
200  Hi  to  find  the  accuracy  to  which  the  azimuth 
of  low-flying  subsonic  aircraft  can  be  tracked 
and  t <f  measure  signal  levels  as  a  function  of 
range.  Flight  tath3  were  radar  tracked  and 
recorded  on  digital  tape.  Propagation  character¬ 
istics  were  calculated  from  measured  meteorological 
data  and  vere  used  to  help  interpret  observed 
signal  level  variations.  The  aircraft  included 
helicopter,  prop-iriven  and  jet  aircraft  which 
flew  30  km  reversed  linear  tracks  at  altitudes 
of  *0  m,  ’00  n  and  1  kn.  Azimuth  was  measured 
by  applying  maximum- likelihood  frequency- 
wavenumber  analysis  to  the  digitized  iata  from 
the  fractional  wavelength  array  of  microphones. 
Measured  azimuths,  after  the  appropriate  moving 
source  corrections,  were  found  to  be  accurate  to 
within  a  few  iegrees  at  ranges  veil  beyond  5  km. 
Filtering  and  array  gain  allowed  us  to  obtain 
coed  azimuth  estimates  for  aircraft  signals  at 
least  10  12  below  the  background  noise  level. 

Introduction 

During  June  19?“,  on  experiment  was  conducted 
at  Fort  Huachuca,  Arizona,  for  the  purpose  of 
collecting  acoustic  and  seismic  array  iata  for 
low-flying  aircraft  travelling  at  subsonic 
velocities.  This  experiment  differed  from  all 
previous  work  known  to  us  in  that  the  exact 
position  of  the  aircraft  a3  a  function  of  time 
was  obtained  by  recording  radar  tracking  informa¬ 
tion  or.  digital  tape. 

The  radar  tracking  and  recording  service 
was  provided  by  the  United  States  Army  Electronic 
Proving  Ground  USAZPG),  Fort  Huachuca,  Aritona. 

In  addition,  U3AZFG  arranged  for  aircraft, 
communication  facilities,  power,  balloon  flights 
to  gather  meteorological  data,  surveillance 
radar,  and  general  physical  support.  The  acoustic 
and  seismic  measurement  and  recording  equipment 
ml  the  necessary  personnel  to  3et  up  and  operate 
it  were  provided  by  the  United  States  Army  Ports 
of  Engineers  Vaterwavs  Experiment  Station  CVESi. 

VE3  also  provided  the  ar.alog-to— digital  iata 
inversion  services  at  their  facility  in  Vicksburg, 
Mississippi. 

In  addition  to  source  location  requirements, 
the  experiment  va3  designed  to  include  as  complete 
i  set  of  the  various  aircraft  source  types  as 
possible  ir.d  known  atmospheric  acoustic  transmis¬ 
sion  characteristics.  There  are  essentially 
four  different  aircraft  acoustic  scarce  types, 
and  me  of  each  .lass  was  employed  in  the  experi¬ 


ment.  The  aircraft  were:  an  A-f  jet,  a  Mohawk 
0V-1D  twin-engined  turboprop ,  a  Seechcraft  U— i 
tvin  reciprocating-type  prop,  and  a  UH-1  nelicopte 
To  adequately  cover  the  possibility  of  atmospheric 
propagation  problems,  the  experiment  was  designed 
such  that  each  aircraft  flew  reversed  tracks  30 
km  in  length  at  altitudes  of  73  n,  3C0  n,  and  1 
km  above  the  recording  site,  and  a  circular 
track  of  radius  L  to  5  km  about  the  recording 
site.  The  tracks  were  repeated  at  various  times 
when  predicted  meteorological  conditions  might 
have  caused  significant  differences  in  acoustic 
propagation  characteristics.  To  firmly  establisn 
acoustic  propagation  characteristics  at  those 
times,  two  meteorological  balloon  flights  were 
conducted  at  the  expected  extreme  acoustic 
conditions,  early  morning  and  mid-afternoon . 

The  aircraft  flight  paths  were  radar  tracked  and 
controlled  and  recorded  on  digital  tape.  Synchro¬ 
nised  acoustic  iata  were  analog  recorded  at  the 
sensor  3ite. 

The  acoustic-seismic  sensors  and  reooriir.g 
van  setup  is  shown  in  Fig.  1,  ar.i  the  array 
setup  is  shown  in  Fig.  2.  The  array  wa3  iesigr.ed 
to  provide  unalia3ed  yet  complete  and  resolvable 
wavenumber  coverage  appropriate  to  the  frequency 
range  5  to  3C0  Hz  for  phase  velocities  of  the 
speed  of  sound  and  greater.  Referring  to  Fig. 

D,  the  instruments  beginning  with  G3  were  Genera* 
Radio  type  15oG-?5  microphones ,  2K  were  2  •«  K 
type  *1 bo  microphones,  DR  were  723  type  tlc- 
three-conponent  gradient  microphones ,  ERT  was  a 
General  Radio  type  19bi-ifc01  electret  miortptcr.e, 
and  5  were  three— component  Mark  Products  L— 
geophones.  The  seismometers  were  tlacec  approxi¬ 
mately  15  cm  below  ground  Level  or.  cememt  tiers. 
ERT  was  placed  at  3  m  above  ground  level,  ind 
all  of  the  ether  microphones  at  1  m  above  ground 
level.  A  GR  microphone  was  placed  approximate*;- 
l.'S  km  GW  of  the  array  to  obtain  Large  range 
signal  coherence  information. 

During  field  operatior.3 ,  aircraft  acoustic 
ar.i  seismic  iata  were  recorded  :n  analog  tape. 

An  additional  amount  of  time  was  spent  recording 
ambient  noise  conditior.3  luring  various  tarts  of 
the  iav  and  no  ght  and  conducting  instrument 
calibrations .  Intervals  of  the  iata  representing 
each  aircraft  type  were  iigititei  at  -ICO  Hz. 
Figures  3  threugn  t  3hcv  examples  of  digitized 
iata  for  each  aircraft  type  :n  the  various 
instruments .  Ve  note  that,  as  expected,  blade- 
rate  harmonic-type  signals  dominate  for  prop- 
driver.  aircraft  and  the  overall  3igr.al  coherence 
13  high  for  all  aircraft  types .  Ve  also  observe 
good  S.'N  ratios  in  the  iata.  The  G7-LD  iata 
were  taaen  during  bad  surface  -wind  conditions 
'rusting  to  35  knots'.  However,  the  acoustic 
noise  from  the  wind  is  well  out  of  the  signal 
band  and  the  ir.bar.i  3  7:  is  3till  Viite  sign .  Ve 


Analysis  ana  Evaluation  : f  Experimental  lata 

Preliminary  results  based  on  a  review  and 
analysis  of  a  part  of  the  experimental  data  are 
summarised  in  the  following  sections,  In  order 
to  interpret  the  results  properly  and  to  under¬ 
stand  the  potential  'use  of  acoustic  sensors,  one 
must  he  aware  of  certain  significant  moving- 
source  and  atmospheric-propagation  effects. 

These  well  known  effects  as  they  apply  to  our 
Particular  situation  are  reviewed  in  the  sections 
preceding  the  analysis  and  evaluation. 

Moving-oource  affects 

Tonsider  that  at  time  t  a  sound  source, 
moving  at  constant  velocity  v,  is  at  a  distance 
vt  from  tr.e  closest  point  of  approach  (opal  to 
an  observation  point  3.  let  the  distance  between 
3  and  opa  he  r  so  that  the  distance  of  the 
source  from  3  at  time  t  is 


"igures  3  and  3  show  the  behavior  of  these 
functions  of  M  *  3.75  and  M  *  3.12,  respectively, 
which  are  the  maocimum  and  nut  urn  urn  values  used  ir. 
the  experiment.  Ve  note  the  following  crooerties 
of  these  functions  which  apply  to  a  greater 
extent  for  larger  Mach  numbers  and  a  lesser 
extent  for  smaller  Mach  numbers.  As  shewn  in 
the  figures,  the  acoustic  position  is  ielayed 
relative  to  the  actual  position  in  proportion  to 
the  Mach  numbers.  At  large  equal  distances,  the 
pressure  on  the  incoming  path  is  greater  than 
the  pressure  on  the  outbound  path  by  a  factor  of 
(1  +  -  M)  .  ?or  example,  a  source  traveling 

half  the  speed  of  sound  radiates  four  times  the 
power  forward  as  backward.  Additionally,  the 
peak  pressure  comes  before  the  vehicle  is  at 
cpa.  Finally,  we  note  that  large  changes  in 
azimuth,  excess  pressure,  and  doppier  '.t/T 
take  place  primarily  in  the  region  when  the 
vehicle  is  within  about  tr/c  of  opa. 

Meteorological  effects 


The  sound  arriving  at  0  came  from  the  source 
when  it  was  at  3ome  earlier  position,  say  a 
distance  of  R  from  3.  The  time  required  for 
sound  to  travel  ?.  .i.e.,  3/c,  where  c  equals  the 
speed  of  sound;  win  equal  the  time  the  source 
took  to  move  from  3  to  3.  let  that  be  a  distance 
1.  Since  the  source  moves  at  v,  then  we  have 
-/v  =  3,'c  so  that  L  *  MR  where  M,  the  Mach 
number,  is  iefined  at  v/c.  The  geometry  of  this 
situation  is  3hown  in  Fig.  From  the  figure 
we  have 


and  30,  for  subsonic  v,  the  acoustic  range  is 
giver,  by 

t !  »  [-Mvr.  +  /(l  -  M^lr2  ■*•  v2t"]/(l  -  M2;  . 

Tsir.g  this  relationship,  the  following  relation- 
snips  ire  found-;  the  true  azimuth, 

v  t ;  »  tan”ir-vt/ri , 


the  acoustic  azimuth, 

sit  *  tan”" [’MR  -  vt', /rj. 


the  perceived  frequency  when  the  source  is 
smithing  a  frequency  f^, 

r. -.  »  r  '[i  -  m  sim'  o' ;, 

ohs  acoustic  range  rate, 

*  -v>  f  /f  /  3in;  0  5 , 


its  acoustic  azimuth  rate, 

: 1 1  «  -rv\  f  f  '  ’5“ 
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In  support  of  the  experiment ,  two  balloons 
were  launched  to  gather  sound  speed,  wind  velocity, 
and  relative  humidity  data.  Balloon  flights 
vere  scheduled  during  the  expected  most  extreme 
acoustic  conditions.  The  purpose  was  to  obtain 
an  appropriate  data  base  for  determining  the 
importance  of  sound  refraction  and  attenuation 
over  the  observed  ietection  range  and  signal 
variations.  The  first  balloon  was  launched 
during  the  mid-afternoon  (approximately  3  p.a. 
local  time )  when  the  normal  lapse  rate  would 
have  a  high  positive  temperature  gradient  approach¬ 
ing  the  ground  surface.  Turing  3uch  conditions, 
the  3ouad  velocity  profile  is  monotcnioaily 
decreasing  upwards  causing  sound  energy  to  be 
continuously  refracted  upward  at  every  level  of 
the  troposphere.  Under  such  ocndifior.3,  detection 
ranges  should  be  less  than  average.  Additionally, 
during  this  period  small-  and  large-scale  thermal 
convection  i3  expected  to  have  non-r.egiigible 
acoustic  effects  by  increasing  the  background 
acoustic  noise  level  'wind  noise'  and  introducing 
asymetry  in  propagation  characteristics.  This 
phenomenon,  due  to  wind  velocity  gradients  will 
alter  the  effective  sound  velocity  profile  as  a 
function  of  propagation  iirecticr..  Figure  13 
shows  the  meteorological  iata  collected  during 
this  time  span.  The  wind  direction  is  not  shown 


although  that  data  was  also  obtained  from  the 


balloon  tracking  iata.  The  sound  speed  is  that 
calculated  directly  from  the  temperature  iata. 
Figure  11  snows  the  east-west  geometrical  propaga¬ 
tion  characteristics  'based  cn  ray  tracing"'  for 
sources  at  IOC ,  ?00 ,  and  ;C30  ft  above  ground 
level.  Only  the  critioal  distances  are  indicated 
and  not  the  ray  paths.  The  calculation  predicts 
shadov  tones  at  2.1  km  vast  and  1.3  rt  east  fer 
the  source  at  200  ft  altitude;  i.e..  a.l  rays 
that  3tart  cut  more  horizontal  :r  upward  tna.t 
the  critioal  rays  are  turned  upward  refer* 
reaching  the  ground.  Similar  statements  are 
true  for  the  sources  at  the  higher  ii.  circle  with 


i.taicv  tone  distances  r 
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tuctea  in  an  easterly  direction.  jlven  manges 
in  the  meteorological  conditions,  or  changes  in 
topography,  sum  energy'  nignt  be  detected  at 
large  ranges.  It  is  noted  that  detection  range 
.assumed  proportional  to  the  shadow- zone  range, 
is  directly  proportional  to  the  source  altitude 
for  these  meteorological  conditions. . 

The  second  balloon  was  launched  during  the 
early  morning  . approximately  2  a. a).  At  this 
time,  the  normal  lapse  rate  3hould  be  underlain 
by  a  high  negative  temperature  gradient  approaching 
the  ground  surface.  Under  these  conditions,  the 
3ound-speed  profile  has  a  zone  of  low  values 
near  the  surface.  Energy  entering  that  zone 
will  be  refracted  toward  the  ground,  increasing 
the  effective  ietecticn  range.  During  periods 
when  such  velocity  conditions  persist,  better 
than  average  ietection  ranges  should  be  encountered. 
Upper  level  'winds  during  this  time  period  may 
also  have  severe  propagation  path  effects. 

However,  there  is  tot  much  3maii-scaie  lower 
level  turbulence  at  that  time  and  consequently 
low  background  noise  levels  should  be  encountered. 
Figure  11  shows  the  balloon  data  collected 
luring  this  early  morning  flight,  and  Fig.  12 
shows  the  ray-tracing  results  based  on  these 
iata.  Just  opposite  to  the  mid-afterr.oor.  results, 
the  lowest  source  nas  the  largest  ietection 
range.  Again,  winds  produce  asymmetry  and  some 
ducting  occurs,  but  overall  much  larger  propagation 
distances  can  be  expected  during  these  conditions. 

In  summary  of  :Se  Theoretical  ray-tracing 
results ,  we  expected  geometrical  detection 
ranges  of  at  least  rij  km  for  all  the  sources 
during  good  acoustic  periods  and  of  at  least  a 
few  kilometers  during  unfavorable  acoustic 
conditions.  Ambient  noise  conditions  iue  to 
surface  -winds,  rain,  etc.,  will  of  course  make 
detection  more  difficult  but  should  not  signifi¬ 
cantly  change  the  above  effects  for  systems  of 
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spreading  for  a  homogeneous  atmosphere.  The 
vertical  lines  show  the  geometrical  3r.adow 
boundaries  determined  from  the  meteorological 
iata.  Finally,  the  peak  noise  level  is  given  by 
the  aorizontai  lines. 

The  data  presented  in  the  figures  can  be 
summarized  as  follows:  in  all  zases,  with  only 
25  iB  of  effective  dynamic  range,  the  aircraft 
•were  acoustically  detectable  out  to  at  least  13 
km,  although  not  always  on  both  the  inbound  and 
outbound  paths.  Amplitudes  die  off  at  about  the 
geometrical  rate,  appropriate  to  a  homogeneous 
atmosphere,  up  to  the  calculated  geometrical 
3hadcw  boundary  for  the  actual  atmospheric 
conditions.  In  the  diffracted  region,  signal 
levels  are  diminished  as  expected,  though  still 
at  detectable  levels.  These  observations  are 
consistent  with  the  predictions  based  or.  the 
measured  meteorological  conations.  Finally,  we 
note  that  even  though  the  ambient  or  background 
r.oise  level  is  often  considerably  greater  than 
the  signal  level,  the  ietection  level  is  still 
hign  since  the  noise  ar.d  signal  energy  are  well 
separated  in  frequency. 

Scectral  Observations 

Figures  19  through  25  show  spectra  computed 
at  1-sec  intervals  for  each  aircraft  type  luring 
selected  part3  of  their  flight  paths.  Figures 
19  to  22  show  the  UH-i  helicopter  beginning  at  '■» 
km  inbound,  -  km  outbound,  and  circling  at  a  :- 
km  distance,  respectively.  The  velocity  cf  the 
aircraft  was  about  -0  a/ sec,  therefore  each 
segment  represents  a  change  in  position  of  10  m. 
The  basic  blade-rate  frequency  for  the  UH-i  was 
11  He.  Figures  22  and  22  show  similar  spectra 
for  the  U-o,  and  Figs.  21  and  25  show  spectra 
for  the  A-T  and  07-13,  respectively.  Fundamental 
blade-rate  frequency  for  the  U-3  was  iSC  Hz  and 
shaft  rate  60  Hz.  The  OV-13  blade-rate  frequency 
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array  configuration  3hovn  in  Fig.  27.  The 
purpose  was  to  establish  the  feasibility  of 
taking  useful  azimuth  measurements  us i ng  a  small 
acoustic  array.  It  is  important  to  distinguish 
between  accuracy  and  precision  in  acoustics 
since,  while  an  acoustic  wavefront  crossing  an 
array  may  be  such  that  a  very  precise  measurement 
:f  its  direction  and  velocity  can  be  made,  the 
direction  may  not  very  accurately  point  to  the 
source  from  which  the  sound  originated.  Inaccu¬ 
racies  of  considerable  magnitude  may  be  present, 
due  to  crosswinds,  topographic  reflections, 
horizontal  temperature  gradients,  and  in  general, 
the  complicated  acoustic  nature  of  the  troposphere. 
Figure  28  shows  the  impulse  response  of  the 
array  contoured  in  1-dB  intervals  in  wavenumber 
space.  The  circle  labeled  C  is  the  wavenumber 
corresponding  to  the  speed  of  sound,  and  any 
propagating  3ound  fields  crossing  the  array  will 
have  their  power  within  this  circle.  Figures  29 
and  2C  show  high-resolution  and  conventional 
wavenumber  spectra  for  a  'JH-1  at  3  2m  acoustic 
range.  A  measure  of  precision  i3  the  width  of 
the  1—13  contour  which  is  a  function  primarily 
of  the  array  configuration  and  chosen  frequency. 

In  this  case,  the  width  is  about  12°.  The 
accuracy  is  lead  on.  The  conventional  spectrum, 
although  less  precise,  is  also  lead  on  in  locating 
the  true  iirection  of  the  source.  Figures  21, 

22,  and  23  3hov  the  results  for,  respectively, 
a  7-8  at  5  km,  an  A-7  at  3  km,  and  an  SV-1D  at 
1.1  km.  In  each  case,  the  accuracy  is  well 
within  the  precision  and  on  the  order  of  at  most 
a  few  degrees.  Figure  31*  shows  the  observed 
acoustic  azimuth  obtained  from  the  acoustic 
array  processing  as  a  function  of  the  acoustic 
range  obtained  from  the  radar  iata  for  a  complete 
•a-7  track.  The  known  acoustic  azimuth,  again 
from  the  radar  iata,  is  3hovn  by  the  solid  line. 
'While  it  13  premature  to  Judge  that  acoustic 
iirection  will  generally  be  determinable  to  the 
accuracy  found  here,  these  results  indicate  that 
such  is  oossible. 


Fig.  1.  Acoustic-seismic  sensor  and  recording 
van  setup. 
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Fig.  2.  Acoustic-seismic  array  setup. 
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^li/rracr-High-level  protocol!  (HLP’s)  ire  the  high-level  languages  of 
distributed  systems.  In  a  resource-sharing  network,  HLP’s  link  processes 
working  on  a  common  application.  The  design  of  an  HLP  is  decom¬ 
posed  into  three  components:  language,  coding,  and  transport.  The 
language  expresses  the  commands  and  data  passed  between  processes. 
It  is  designed  to  provide  standardization  and  device  independence,  in 
order  to  use  a  small  number  of  HLP's  to  address  a  range  of  applications 
implemented  on  a  variety  of  computer  systems.  Coding  converts  the 
language  into  digital  messages.  Finally,  a  transport  system  is  used  to 
transmit  the  messages  from  one  process  to  another -experience  with 
HLP's  has  shown  that  different  HLP’s  require  different  transport  be¬ 
haviors.  This  paper  describes  some  examples  of  HLP’s  (ARPA  network 
voice  and  graphics  protocols),  and  argues  that  modern  techniques  for 
expressing  structure  and  control  in  programming  languages  should  be 
applied  to  analogous  problems  in  communication  among  application 
processes  in  a  network. 

r.  Introduction 

IGH-LEVEL  PROTOCOLS  (HLP’s)  are  the  high-level 
languages  (HLL’s)  of  a  distributed  computing  environ¬ 
ment.  They  are  the  means  by  which  a  communications 
network  and  processing  resources  in  the  network  are  harnessed 
in  an  orderly  way  to  accomplish  some  task.  Using  different 
resources  in  an  application  on  a  single  computer  is  by  now  nat¬ 
ural,  aided  by  operating  systems,  high-level  languages  memory 
management,  and  other  well-established  tools.  But  a  resource¬ 
sharing  computer  network  offers  to  an  application  the  resources 
available  throughout  the  network  [  18] ,  (291  -  to  harness  these 
distributed  resources  requires  the  collaboration  of  several  com¬ 
puters  and  many  computational  processes,  and,  therefore,  a 
set  of  conventions  for  harmonious  communication  that  ex¬ 
tends  throughout  the  distributed  environment.  HLP’s  are 
these  conventions. 

The  term  “high-level  protocol’’  attempts  to  distinguish 
protocols  designed  to  control  the  computing  processes  involved 
in  an  application  from  “low-level  protocols"  designed  primarily 
to  control  communication  processes.  Two  computer  processes 
require  communications  in  order  to  transmit  collections  of  bits 
from  one  to  the  other.  But  it  is  the  HLP  that  assigns  an  inter¬ 
pretation  to  the  bits,  and  thus  allows  each  process  to  control 
the  other.  An  analogy  may  be  helpful.  To  call  a  department 
store  on  the  telephone  is  simply  to  establish  communication. 
But  when  the  caller  says,  “Please  send  me  two  pillowcases, 
catalog  number  X802,  and  bill  my  account,”  he  is  using  an 
HLP  that  controls  ordering  goods.  Thus  the  HLP  is  imple¬ 
mented  “on  top  of’  the  low-level  communication  protocols. 

Manuscript  received  February  2S,  1978;  revised  June  9,  1978,  This 
work  was  supported  by  the  Advanced  Research  Projects  Agency  of  the 
Department  of  Defense  under  Contract  DAHC- 1 5-72-C0308,  ARPA 
Order  2223,  and  under  Contract  F44620-73-C-0074,  monitored  by  the 
Air  Force  Office  of  Scientific  Research. 

R.  F.  Sprouil  is  with  the  Computer  Science  Department,  Carnegie- 
Meilon  University,  Pittsburgh,  PA  15213. 

D.  Cohen  is  with  USC/Information  Sciences  Institute,  Marina  del  Rey, 
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Of  course,  HLP’s  that  control  computer  processes  need  precise 
definitions  of  commands  and  responses,  and  cannot  rely  on 
human  intelligence  to  provide  an  interpretation.  (The  paper 
by  Pouzin  and  Zimmermans  [26]  in  this  issue  introduces  the 
notion  of  “protocol"  and  the  concept  of  “layering”  by  which 
an  HLP  is  implemented  in  terms  of  lower  level  communica¬ 
tions  protocols.  Their  term  “application  control”  is  equivalent 
to  our  “high-level  protocol.”) 

The  services  available  in  a  resource-sharing  network  are 
sufficiently  diverse  that  several  different  HLP’s  need  to  be  de¬ 
signed.  For  example,  HLP’s  have  been  developed  to  control 
job  entry,  interactive  terminals,  file  systems,  graphics  displays, 
resource  allocation,  and  voice  transmission  in  distributed  en¬ 
vironments.  But  why  are  HLP’s  required  to  control  these 
services?  Is  not  the  communication  provided  by  a  network 
sufficient?  The  need  can  be  demonstrated  by  an  example.  At 
first,  using  a  file  storage  service  seems  to  be  a  simple  matter  of 
sending  a  file  to  the  service  as  a  serial  stream  of  bits,  and  subse¬ 
quently  retrieving  the  same  stream.  But  additional  attributes 
of  the  file  must  also  be  saved  and  retrieved:  a  file  name,  some 
information  about  how  the  file  is  to  be  protected  from  un¬ 
authorized  access,  accounting  information  to  permit  charging 
for  the  storage  of  the  file,  and  so  forth.  Moreover,  a  file 
service  must  provide  more  operations  than  storage  and  retrieval: 
enumerating  names  of  files  stored,  deleting  or  renaming  files, 
moving  files  from  one  storage  medium  to  another,  etc.  The 
HLP  must  select  among  these  operations,  provide  the  informa¬ 
tion  needed  by  them,  and  report  errors  encountered  as  the 
operations  progress,  in  addition  to  simply  transmitting  file  data. 

A  resource-sharing  network  becomes  increasingly  valuable  as 
more  HLP’s  that  offer  services  are  designed  and  implemented. 
Thus  an  overall  objective  of  protocol  development  is  to  seek 
methods  and  tools  that  simplify  HLP  design  and  implementa¬ 
tion.  Already,  experience  with  HLP's  has  uncovered  common 
design  techniques,  some  of  which  are  reviewed  in  this  paper. 
But  techniques  for  specifying  HLP’s  and  for  implementing 
them  remain  primitive.  We  argue  that  methodology  and  tools 
for  designing  and  implementing  HLP’s  can  be  improved  by 
applying  notions  developed  for  HLL’s-in  effect  extending  the 
HLL  into  a  distributed  environment.  Just  as  the  HLL  frees 
the  programmer  from  many  tiresome  details  of  programming  a 
particular  computer,  the  extension  frees  the  programmer  from 
the  tiresome  details  of  dealing  with  network  communications. 

To  construct  such  an  argument,  a  better  understanding  of 
actual  HLP’s  is  required.  The  remainder  of  this  introductory 
section  illustrates  the  range  of  HLP's  used  in  a  resource-sharing 
network.  Section  II  builds  a  detailed  understanding  of  the  in¬ 
gredients  and  requirements  of  HLP’s  by  defining  a  model  and 
characterizing  some  very  different  protocols  in  terms  of  the 
model.  Section  HI  reviews  the  issues  faced  in  the  design  of 
HLP's,  and  common  techniques  used  in  the  designs.  Finally, 


0018-9219/78/1 100-1371  S00  75  ©  1978  IEEE 

78 


1372 


Fig.  l.  A  resource-sharing  network.  “User  processes”  obtain  services 
from  “server  processes."  under  the  control  of  HLP's.  Heavy  lines  link¬ 
ing  processes  in  the  illustration  denote  the  logical  connections  that 
carry  HLP.  These  connections  are  unrelated  to  the  physical  organiza¬ 
tion  of  the  network. 


Section  IV  shows  how  many  of  the  needs  of  HLP's  can  be 
addressed  by  HLL  facilities. 

A.  The  Role  of  HLP's 

HLP’s  are  used  in  two  slightly  different  ways  in  networks. 
In  the  first  use,  the  basis  of  resource-sharing  networks,  HLP’s 
control  the  services  offered  by  resources  in  the  network.  Vari¬ 
ous  sites  in  the  network  implement  server  processes  that  offer 
a  specific  service  to  any  user  process  that  communicates  re¬ 
quests  to  it  (Fig.  1).  In  these  settings,  activities  are  instigated 
by  the  user  process;  the  server  is  a  willing  slave.  Examples  of 
important  services  are  as  follows. 

al  Remote  iob  entry  service:  A  user  process  submits  to  the 
server  a  file  of  text  that  controls  running  batch  jobs  (job  con¬ 
trol  language).  Later,  the  user  process  may  retrieve  from  the 
job  entry  server  the  “listing”  text  produced  as  a  result  of  the  job. 

b)  Remote  terminal  service:  A  user  process  can  “login”  to 
a  timesharing  system  service,  and  subsequently  send  and  re¬ 
ceive  characters  as  if  it  were  a  terminal  on  the  system.  We  shall 
use  the  term  “TELNET”  for  this  service,  adopting  the  name 
of  the  protocol  used  in  the  ARPA  network  [7) . 

c)  File  service:  A  user  process  can  send  data  files  to  the 
service  for  storage  and  subsequent  retrieval.  The  file  service 
may  be  provided  with  conventional  disk  equipment,  or  may 
have  special  file-storage  resources  such  as  a  mass  memory. 

d)  Mail  service :  A  service  accepts  text  messages  addressed 
to  individual  people.  Later  the  individuals  can  interrogate  the 
service  through  a  user  process  to  receive  any  mail  that  has 
been  left  for  them. 

e)  Resource-sharing  service:  This  service  accepts  requests 
from  user  processes  to  allocate  resources  of  the  computer  sys¬ 
tem  on  which  the  service  executes  (jobs,  processes,  files,  I/O 
devices),  and  makes  these  resources  available  to  other  processes 
in  the  network  [361- 

Each  of  these  services  has  a  corresponding  HLP  for  accessing 
and  controlling  the  server  process.  These  services  are  analogous 
to  the  services  that  are  offered  on  a  single  computer  by  an 
operating  system.  A  mechanism  for  storing  files,  for  example, 
is  a  vital  service  provided  by  an  operating  system.  Extending 
this  service  into  a  resource-sharing  network  allows  greater 
sharing  of  the  file  data,  as  well  as  economical  storage  of  files 
with  specialized  hardware.  Thus  HLP’s  are  not  entirely  new 
phenomena -similar  concepts  have  existed  within  operating 
systems-but  are  now  being  revealed  as  they  are  extended 
into  networks. 

A  second  use  for  HLP's  links  in  a  symmetric  fashion  two 
processes  that  are  part  of  a  single  application.  Such  a  link  may 
be  the  result  of  the  division  of  a  software  system  into  separate 
modules.  Or  it  may  be  that  the  communicating  parties  them- 
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selves  are  symmetric,  as  is  the  case  in  the  network  voice  proto¬ 
col  described  in  Section  II-D.  In  these  cases,  the  user-server 
distinction  is  not  relevant.  Thus  applications  may  use  HLP’s 
both  to  access  services  and  to  communicate  among  parts  of 
an  application. 

HLP’s  offer  many  new  opportunities  to  distribute  applica¬ 
tions.  An  application  can  be  divided  into  processes,  each  of 
which  can  be  executed  on  the  computer  best  suited  to  its 
needs:  signal  processing  tasks  are  accomplished  on  fast  parallel 
machines;  symbolic  computation  on  computers  with  large 
address  spaces  suited  to  list  processing.  Processes  that  com¬ 
municate  directly  with  a  human  by  voice,  graphical,  or  textual 
information  can  travel  with  the  user,  communicating  with 
collaborating  processes  in  the  network  by  packet  radio  links 
[19].  Distributed  applications  can  evolve  as  requirements 
change,  by  modifying  HLP’s,  adding  new  services,  distributing 
the  computing  activity  among  a  wider  set  of  resources,  etc. 
The  ease  with  which  HLP’s  can  be  designed  and  implemented 
will  control  the  extent  to  which  these  opportunities  can  be 
fulfilled. 

B.  Standardization 

Achieving  the  standardization  required  to  offer  services  on  a 
variety  of  computers  is  a  major  goal  of  HLP’s.  Before  the  ap¬ 
pearance  of  large  computer  communication  networks,  HLP’s 
were  largely  a  private  matter.  A  programmer  who  wanted  to 
design  an  application  consisting  of  several  processes  and  wanted 
to  provide  a  method  for  communicating  among  them,  had  only 
to  establish  some  personal  conventions  for  the  interprocess 
communication.  Networks  spoiled  this  simple  approach.  Com¬ 
munication  protocols  became  a  matter  of  public  decision  in 
the  hope  that  a  single  standard  design  could  serve  a  wide  com¬ 
munity  of  computers  and  users;  interconnecting  a  variety  of 
computers  unleashed  a  barrage  of  problems  due  to  different 
representation  conventions  in  different  machines  (word  lengths, 
character  sets,  and  the  like)  and  in  different  operating  systems 
(files,  naming,  and  protection  mechanisms);  networks  forced 
slower  communication  among  processes  than  did  tightly 
coupled  systems  and  seemed  to  magnify  problems  of  syn¬ 
chronizing  processes. 

The  importance  of  standardization  is  demonstrated  by  the 
“n7  problem."  Imagine  a  network  of  n  computers,  each  of 
which  offers  a  mail  server  that  uses  a  distinct  HLP.  Each  com¬ 
puter  will  require  a  separate  user  program  to  send  and  receive 
mail  for  each  protocol-a  total  of  n  programs  on  each  com¬ 
puter  or  n 7  in  all.  By  contrast,  if  the  mail  servers  all  imple¬ 
ment  a  standard  HLP  for  sending  and  receiving  mail,  only  n 
user  programs,  one  on  each  computer,  are  required  to  obtain 
access  to  all  mail  services  in  the  network.  The  high  cost  of 
writing  and  maintaining  software  makes  protocol  standardiza¬ 
tion  essential.  The  techniques  and  conventions  that  permit 
standardization  over  a  range  of  computing  equipment  are 
a  major  ingredient  in  the  design  of  HLP’s. 

0.  Three  High-Level  Protocols 

This  section  describes  three  HLP’s  as  a  prelude  to  a  more 
general  discussion  of  issues  involved  in  designing  and  imple¬ 
menting  HLP’s.  These  examples  are  couched  in  terms  of  * 
model  of  three  components  of  an  HLP:  language,  coding,  and 
transport.  The  first  example,  a  plotter  protocol,  illustrates  the 
model.  The  tvo  remaining  examples,  a  graphics  protocol  and 
a  voice  protocol,  are  actual  HLP’s  that  demonstrate  a  range  of 
HLP  design  techniques. 
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.-1.  The  Components  of  an  HLP 

Designing  HLP’s  seems  at  first  to  require  a  large  number  of 
decisions,  ranging  from  communication  formats  to  application 
requirements.  The  design  problem  can  be  decomposed  into 
three  aspects  of  HLP’s:  the  language,  the  • coding ,  and  the 
transport  mechanism. 

Language :  The  language  describes  the  functional  intent  of 
the  communications  exchanges  between  processes:  What  does 
one  process  “say”  to  the  other?  What  responses  does  the 
other  process  generate?  How  does  one  process  cause  the  other 
to  alter  its  state ?  The  language  may  include  commands  or 
other  control  information,  and  will  usually  include  data  as 
well.  The  designer  of  a  language  to  offer  a  service  must  antici¬ 
pate  how  other  processes  will  use  this  service,  and  provide 
within  the  language  the  means  for  those  processes  to  request 
and  receive  the  service  they  need. 

Coding:  Any  communication  between  two  processes  must 
be  coded  into  a  collection  of  bits  that  the  processes  know  how 
to  generate  and  interpret.  For  example,  if  the  language  re¬ 
quires  transmission  of  a  character  string,  it  may  be  coded  into 
a  sequence  of  8-bit  bytes  according  to  the  ASCII  standard,  an 
integer  value  into  a  32-bit  two's  complement  binary  representa¬ 
tion,  and  so  forth.  Just  as  data  structure  declarations  in  an 
HLL  impose  structure  on  an  otherwise  homogeneous  memory, 
coding  imposes  structure  on  a  previously  homogeneous  com¬ 
munications  path. 

Transport :  The  communication  path  is  provided  by  the 
transport  mechanism,  the  means  to  collect  bits  from  one 
process  and  deliver  them  to  another.  If  both  processes  reside 
in  the  same  computer,  this  mechanism  is  provided  by  the  oper¬ 
ating  system  or  by  the  language  environment  in  which  the 
processes  execute  (e.g.,  “pipes”  in  Unix  (281 ).  If  the  processes 
execute  on  separate  machines,  the  transport  may  be  provided 
by  a  network.  We  prefer  to  avoid  stating  which  processes  exe¬ 
cute  in  which  computer  and  consequently  which  transport  is 
provided  by  networks  and  which  by  other  means.  We  may  be 
concerned  with  performance  properties  of  the  transport 
mechanism  (e.g.,  speed,  delay,  buffering,  errors),  but  not  with 
the  communications  methods  themselves.  As  a  consequence 
of  this  view,  we  must  assume  that  HLP’s  cannot  depend  on  the 
processes  sharing  other  resources,  such  as  memory  or  files:  the 
protocol  must  provide  all  of  the  interprocess  communication. 
We  shall  use  the  term  message  to  refer  to  the  individual  pack¬ 
ages  of  bits  that  are  transmitted  from  process  to  process.  A 
network  will  typically  use  additional  messages  to  help  the 
transport  mechanism  operate  smoothly:  these  lower  level  pro¬ 
tocols  are  concerned  with  flow  control,  error  recovery,  and 
the  like  (for  more  discussion  of  lower  level  or  “host-host” 
protocols,  see  [  1  ] ,  (2) ,  [26( ). 

When  designing  and  describing  HLP’s,  it  is  important  to 
separate  these  three  aspects  of  interprocess  communication 
(IPC).  The  language  can  be  likened  to  HLL  constructs  involv¬ 
ing  control  and  data:  procedure  calls,  coroutine  calls,  etc. 
Coding  is  analogous  to  choosing  a  computer  representation 
based  on  collections  of  bits:  integers,  text  characters,  or  com¬ 
puter  instructions.  The  design  of  the  transport  mechanism  is 
concerned  with  issues  such  as  bandwidth,  delay,  and  error  re¬ 
covery,  and  with  the  low-level  protocols  that  provide  the  trans¬ 
port.  If  these  three  aspects  are  not  kept  separate,  and  we 
think  of  HLP’s  as  configurations  of  bits  flowing  along  com¬ 
munications  lines,  we  will  become  hopelessly  confused.  Imag¬ 
ine  trying  to  describe  how  a  PDP-1 1  works  by  describing  only 
bit  patterns  and  signal  transitions  on  the  Unibus! 


Fig.  2.  Two  processes  using  the  plotter  protocol.  The  application  sends 
messages  to  the  plotter  process  that  completely  describe  the  figures 
to  be  plotted.  The  two  processes  may  execute  on  separate  computers 
linked  by  a  network,  or  on  a  single  computer  that  provides  message 
communication  between  the  two  processes. 

The  three  components  are  not  independent,  but  interact  in 
ways  that  must  be  addressed  by  the  HLP  designer.  Transport 
bandwidths  or  errors  may  affect  the  choice  of  coding  tech¬ 
niques.  Language  choices  often  affect  coding,  and  can  also 
impact  the  design  of  transport  methods.  In  the  sections  that 
follow,  some  of  these  interactions  are  demonstrated  by  the 
three  example  HLP’s. 

B.  A  Plotter  Protocol 

We  shall  illustrate  in  this  section  a  simple  HLP  for  controlling 
a  pen  plotter.  Such  a  protocol  could  be  used  to  offer  “plotting 
services”  in  a  distributed  system:  one  or  more  “plotter  server” 
processes  are  provided,  each  controlling  plotter  hardware  in 
response  to  commands  and  data  provided  by  the  plotter  pro¬ 
tocol.  In  order  to  make  plots,  an  application  process  such  as  a 
curve-fitting  package  establishes  communication  with  a  plotter 
process  that  will  provide  the  plotting  services.  Fig.  2  shows 
the  logical  arrangement  of  these  processes. 

Language:  The  language  for  the  plotter  HLP  is  designed  to 
allow  the  application  program  to  produce  drawings  by  issuing 
commands  to  the  plotter  process.  The  application  program 
needs  to  be  able  to  start  a  fresh  plot,  to  raise  and  lower  the 
plotter  pen,  and  to  draw  straight  lines.  These  commands  in 
the  language  might  be  provided  by  procedures  that  are  called 
by  the  application  program: 

BeginPlot  Procedure  to  place  a  new  sheet  of  paper  in  the 
plotting  area. 

EndPlot  Procedure  to  signal  that  a  plot  is  complete. 

Up  Procedure  to  raise  the  plotter  pen  so  that  it  no 

longer  touches  the  paper. 

Down  Procedure  to  lower  the  plotter  pen  so  that  it 

will  mark  on  the  paper. 

LineTo(x,y)  Procedure  to  cause  the  plotter  pen  to  move 
from  its  present  location  in  a  straight  line  to 
the  point  (x,y).  The  Cartesian  coordinates 
specify  a  location  on  the  paper,  in  units  of 
centimeters  from  the  lower  left  comer. 

Notice  that  these  procedures  show  no  effect  of  the  presence  of 
a  network -the  same  procedures  could  be  used  to  control  a 
plotter  connected  directly  to  the  application  program. 

Coding:  In  order  to  transmit  to  the  plotter  process  the  com¬ 
mands  invoked  by  the  application  process,  each  command 
must  be  encoded  as  a  sequence  of  bits.  For  our  example,  we 
shall  use  8-bit  ASCII  characters  as  the  basis  for  the  coding.  A 
command  is  designated  by  a  single  character  corresponding  to 
the  procedure  name  (B,  E,  U,  D,  L).  The  (x.y)  arguments  to 
the  “L"  command  can  be  represented  as  two  four-digit  num¬ 
bers  in  units  of  1/100  cm.  Thus  the  protocol  sequence 

B  U  L  0100  0100  D  L  0200  0100  L  0200  03 50  L  0100  0100  E 

will  plot  a  triangle  with  vertices  (1,1),  (2,1),  and  (2.1.51  cm. 
Of  course,  another  encoding  could  be  devised  that  requires 
fewer  bits  to  code  the  same  language.  In  any  case,  very  little 
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computation  is  required  to  encode  and  decode  the  plotter 
language. 

Transport:  The  transport  system  must  deliver  the  individual 
8-bit  bytes  to  the  plotter  process.  If  a  packet-switched  net¬ 
work  is  used  for  transport,  it  is  sensible  for  the  application 
process  to  pack  several  dozen  bytes  into  a  message  before  the 
message  is  transmitted.  However,  the  EndPlot  procedure  must 
force  the  transmission  of  the  last  message,  even  if  it  is  only 
partially  full,  to  insure  that  the  final  “E”  code  is  delivered  to 
the  plotter  process. 

-  For  the  plotter  protocol,  a  general-purpose  interprocess 
transport  mechanism  such  as  TCP  [2]  is  ideal.  The  transport 
reliability  obtained  from  such  a  mechanism  is  essential  for  the 
plotter  protocol:  errors  in  the  data,  or  disorder  in  the  sequence 
of  messages  decoded  by  the  plotter  process  will  have- disastrous 
effects.  However,  the  protocol  requires  neither  extremely  high 
transmission  rates  nor  extremely  short  transmission  delays,  be¬ 
cause  most  plotters  are  quite  slow. 

Discussion:  The  plotter  protocol  illustrates  two  important 
aspects  of  HLP’s:  standardization  and  device  independence.  A 
resource-sharing  network  may  offer  several  plotter  server 
processes,  all  of  which  use  a  standard  plotter  protocol.  Stan¬ 
dardization  is  required  for  all  components  of  the  HLP: 
language,  coding,  and  transport.  The  method  of  packing  bytes 
into  messages  and  the  low-level  protocols  used  to  achieve 
reliable  transport  of  messages  must  be  standardized.  Coding 
is  standardized  by  the  ASCII  character  standard  and  the 
definition  of  character  sequences  used  by  the  plotter  pro¬ 
tocol.  Finally,  the  concepts  of  the  language  are  standardized: 
Up,  Down,  and  LineTo  are  but  one  way  to  control  a  plotter, 
but  they  are  chosen  as  the  standard.  The  language  standard 
imposes  a  method  for  controlling  the  plotter  that  may  have 
far-reaching  ramifications  in  the  structure  of  the  application 
program. 

Standardization  begets  generalization:  if  a  single  standard  is 
to  be  designed,  it  should  be  able  to  cope  with  as  many  similar 
uses  as  possible.  This  property  is  often  called  “device  inde¬ 
pendence,”  although  a  physical  device  is  not  always  involved. 
The  idea  is  to  make  the  protocol  independent  of  particular 
hardware  details  by  abstracting  the  elements  common  to  the 
control  of  a  range  of  devices.  The  plotter  protocol  described 
above  is  somewhat  device  independent:  plotters  with  various 
resolutions  (e.g.,  100  steps  per  centimeter,  173  steps  per 
centimeter)  can  clearly  be  used— the  plotter  process  converts 
the  standard  units  of  1/100  cm  into  the  coordinate  system 
used  by  the  hardware  it  controls.  Electrostatic  plotters  or 
storage-tube  displays,  even  though  they  have  no  pen  to  be 
raised  and  lowered,  can  also  be  correctly  controlled  by  the 
protocol  because  the  only  effect  of  protocol  commands  is  to 
describe  visible  lines.  However,  colored  pens  on  a  plotter  can- 
lot  be  handled  properly  by  our  protocol— it  is  not  general 
enough  to  handle  more  than  one  color. 

The  protocol  illustrates  a  style  of  HLP  design  that  derives 
the  language  from  a  standard  subroutine  interface.  Each  pro¬ 
cedure  call  causes  a  record  of  the  procedure  name  and  argu¬ 
ments  to  be  encoded  and  sent  through  the  network.  The  re¬ 
ceiver  decodes  the  messages  and  usually  calls  corresponding 
procedures  that  actually  drive  the  plotter.  The  network  thus 
provides  a  binding  mechanism  by  which  calls  to  the  five  pro¬ 
cedures  in  the  application  program  are  converted  into  calls  to 
procedures  in  the  plotter  process.  Ideally,  the  network  is 
“transparent”  to  the  application  program  -its  presence  cannot 
be  detected! 


Fig.  3.  An  application  using  two  HLP’s.  A  person  uses  the  TELNET 
service  to  control  a  curve-fitting  application  running  on  a  time-sharing 
system.  This  program,  in  turn,  uses  a  high-level  graphics  protocol  to 
control  a  display  that  shows  the  curve-fitting  results. 


The  plotter  protocol  is  extremely  simple  compared  to  most 
real  HLP’s,  which  attempt  greater  device  independence  and 
which  require  bidirectional  communication  between  processes. 
The  following  two  sections  describe  two  actual  HLP’s  de¬ 
veloped  by  the  ARPA  network  community:  a  network  graphics 
protocol  (NGP)  and  a  network  voice  protocol  (NVP).  Because 
these  two  descriptions  are  rather  long,  the  reader  may  prefer 
to  skip  to  Section  III,  which  contains  a  summary  of  techniques 
used  in  designing  HLP’s. 

C.  A  Network  Graphics  Protocol  (NGP) 

The  objective  of  an  NGP  is  to  provide  a  standard  way  for  an 
application  program  to  control  an  interactive  graphics  display. 
It  is  a  natural  extension  of  the  idea  of  the  TELNET  protocol 
which  allows  an  alphanumeric  terminal  connected  to  one  com¬ 
puter  to  act  as  a  terminal  to  another.  A  user  with  a  graphics 
display  and  computer  capable  of  obeying  the  NGP  would  have 
access  to  a  potentially  large  number  of  network  resources  that 
use  graphical  interaction  (Fig.  3).  Because  users  with  different 
display  devices  will  want  to  use  these  services,  the  NGP  design 
must  be  independent  of  the  particular  device  being  used. 

The  plotter  protocol  developed  in  the  previous  section  could 
be  used  as  an  NGP:  it  can  control  the  generation  of  line-drawing 
images  on  a  display.  If  a  storage-tube  display  is  used,  the  com¬ 
mands  to  the  hardware  are  almost  identical  to  those  provided 
by  the  protocol.  But  an  NGP  needs  greater  generality,  includ¬ 
ing  the  ability  to  handle  a  wide  variety  of  display  hardware,  to 
provide  fast  interaction,  and  to  cope  with  different  kinds  of 
graphical  input  devices. 

The  methodology  we  find  appropriate  for  developing  an 
NGP  is  to  first  design  a  device-independent  graphics  package 
(or  copy  an  existing  design)  and  then  adapt  it  to  network  use. 
The  device-independent  design  wrestles  with  the  functional 
capabilities  of  the  system  and  the  model  that  the  application 
program  uses  to  invoke  them.  The  network  adaptation  then 
need  address  only  coding  and  transport. 

The  particular  NGP  described  in  this  section  was  developed 
for  use  by  the  ARPA  network  [33],  [14].  It  is  based  on  a 
"general-purpose  graphics  package”  [31],  [40],  which  at¬ 
tempts  to  cover  a  range  of  applications  requiring  interactive 
line-drawing  displays.1  We  cannot,  in  this  paper,  take  up  the 
many  issues  that  surround  the  design  of  such  a  package  (see 
[24],  [25]).  However,  we  will  discuss  how  network  com¬ 
munications  interact  with  the  design.  A  summary  of  the 
protocol  is  given  in  Table  1. 

Language:  The  functions  of  a  graphics  package  can  be  di¬ 
vided  into  three  categories:  those  for  defining  an  image  to  be 

1  Other  protocols  hive  been  proposed  for  graphical  needs  that  lie  out¬ 
side  the  NGP’S  objectives:  carefully  formatted  text  ( 32  |  and  sampled 
images  such  as  might  be  csptured  with  a  TV  camera  I ZZ ) . 
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TABLE  I 

Summary  of  ARP  A  Network  Graphics  Protocol  Each  Has  the 
Same  Format,  a  Command  Followed  by  Arguments.  The  Arguments 
Are  Listed  Inside  Brackets  ((  )).  Details  of  the  Protocol  Are 
Found  in  (33J 


control-. 

scf..op«n  <*cg.n«m** 
i**C  dot* 

SCR  poll 

scr.-unpott  <%cp..n«#ne» 

SCR  Kill 

SCR  Append  <*«*.namq* 

end  bslclvol. updates 

seR  dot  ^.coordinate*  <y -coordinate* 
scR.move  <«.coorduuta»  <y.coordin*l«* 
see  draw  <«xoordinale»  <y. coordinate* 
seg.lexl  <le*t.alnnf* 

sel.mtensity  <count* 
set. type  <cooot* 

set. char scier . orient  attofidtscrata  <count» 
sei  character.ortenlatKKtconlinuous  <tr action* 
sct.charactcr.si/e.d'scrcte  <count> 
set  character  .size. continuous  <f  faction* 

change. attribute  <seg.name*  <attnbute» 

-possible  <at tribute*  arguments: 

<set.hiRhtitHt.il tribute*  <on/off» 

<sel  hit  sensitivity,  at tribute*  <on/o<t* 

< Set. intensity. attribute*  <count> 

<set.screen.seiect.attnbute»  <masK* 

< sc t.posifKjn. attribute*  <*.coordm#te»  <y  'oord'"ate> 

seg.r  e  adbactuseftist 

set  readbecK-scg  <sef  name* 

Positioned  test: 

ptevt.open  <p test. name*  <counl>  <s  coordinate*  <•.  coordinate* 

<*  coordinate*  <y.coordinale>  <count> 
plevt.Kitl  <pte«t. name* 
ptcxt.set  <plest  name*  <masK» 

ptest  scroll  up  <ple«t.name>  <slnng  number*  <str*^  number* 
pi**!. scroll-down  «ptest.name>  <stnnt.number*  <v-ng.mjmbar» 
ptcst.move  <plesl.name»  *  string  number*  <ple»frj-Ne*  Mtrrng  number* 
ptCK(  e<*t  <ptes(.name*  <stnng  number*  'post*  <r  :*2»  <taaf.string* 
ptewt  modify  <ptest  name*  <slring.number>  <posl*  <pos?>  <ptest.featura> 
ptext  remote  edit  <pte*t.name*  <pte*t.strint> 

Input  (unctions: 

input. enable  <techmoue  number*  <disable  con-itio-*  <racort.svquaftCa* 

input. disable  < technique  number* 

input. report  < technique  number*  * technique  lummay* 

input.reset.syslem 

input  dr  ag.se t  <tct  hmque. number  *  <seRnamc> 

Inquiry; 

inquire 

inqutre  response  <<ount>  Response  phrase*  - 
-sample  'response  phrase >s  ar#: 

^.implemented  commands*  <masK> 

<i.scrcen  coordinates*  <»,lelt»  <y  bottom*  <s.f'Rht>  <y.top*  <COunt» 
<i.  screen,  si  re*  <ta«t  string*  <t««t  string* 

<i. screen  number*  <count» 

<4.ler>mnat  name*  steal. string* 

<i.term»nal  type*  <t«rm»nat  type  value* 

^.intensities*  <cov>nt* 

<i.Hrw  type*  <<Oun|* 

<i.eharaclers>  <mesK> 

^.character  onenljtions*  <<ount>  <frac(«on*  — 

<i.characler  sue*  <count»  <character.sir#  description*  _ 

<t.ava«l  able  input  faebniqua*  *  technique  number*  <tert.«fnng*  - 

Miscellaneous: 
eccape  protocol  Mast  stnng» 
synchronize  «counl* 
reset 

error. string  <te*|  string* 


displayed,  those  for  accepting  responses  from  the  user  via 
graphical  input  devices,  and  those  that  inform  the  application 
program  of  important  properties  of  the  display.  The  discussion 
of  the  NGP  language  is  likewise  divided  into  considerations  of 
output,  input,  and  inquiry. 

Language -Output:  One  of  the  most  critical  requirements 
of  an  interactive  graphics  system,  and  one  that  might  be  com¬ 
promised  by  a  network,  is  the  speed  with  which  the  display 
can  be  altered-this  is  the  crux  of  many  interactive  dialogues. 
This  consideration  forces  us  to  abandon  the  plotter  protocol: 
to  make  a  small  change  to  a  display  of  2000  lines  would  re¬ 
quire  transmitting  a  description  of  the  entire  image  (roughly 
40  000  bits,  even  with  a  fairly  efficient  coding).  Furthermore, 
such  a  technique  burdens  the  application  program  by  making 
it  recompute  the  image  from  its  application  data  structure. 

An  effective  technique  that  solves  these  problems  is  the 
segmented  display  file.  Rather  than  manipulate  one  large 
description  of  the  image,  the  graphics  system  maintains  a  set 
of  independent  segments  that  together  comprise  the  display. 
Each  segment  is  assigned  a  numeric  “name"  for  identification, 
and  contains  a  list  of  graphic  primitives  (lines  or  characters) 
that  are  part  of  the  image.  Each  segment  also  has  a  flag 
(on, off)  that  indicates  whether  the  primitives  described  in  the 
segment  are  to  be  displayed  on  the  screen.  The  system  pro¬ 
vides  facilities  for  specifying  the  contents  of  a  specific  seg¬ 
ment,  and  for  altering  the  Hag.  To  create  a  segment  named  8 
that  contains  a  description  of  the  triangle  in  the  plotter  ex¬ 
ample,  the  application  might  call  the  following  procedures  in 
the  graphics  package : 

OpenSegment(8)  Says  primitives  are  to  become  segment  8. 

MoveT o(l,l)  Primitives  defining  triangle . 

DrawTo(2,l) 

DrawTo(2,3.5) 

DrawTo(l,l) 

CloseSegmentO  Ends  construction  of  segment  8. 

PostSegment(8)  Sets  segment  8  flag,  so  it  will  be  displayed. 

The  calls  simply  build  a  display  file  that  a  hardware  display 
processor  interprets  to  refresh  the  display.  Changes  to  the  dis¬ 
play  file  as  a  result  of  these  calls  are  reflected  as  changes  in  the 
image  on  the  screen. 

The  segment  thus  becomes  the  unit  of  information  that  is 
easily  changed  with  each  interaction.  It  is  not  difficult  to 
write  most  application  programs  so  that  the  segments  corre¬ 
spond  to  logically  separate  parts  of  the  display  in  such  a  way 
that  display  alterations  are  often  limited  to  a  small  fraction  of 
ail  the  segments.  If  information  within  a  segment  must  be 
changed,  the  segment  is  entirely  rebuilt  with  calls  such  as  those 
above.  Or  the  segment  may  be  deleted  if  its  contents  are  no 
longer  needed.  Or  the  information  in  a  segment  can  be  re¬ 
tained  for  possible  use  later.  This  technique  is  useful  for  re¬ 
curring  imagery  such  as  messages  and  menus. 

The  graphics  package  also  provides  an  “update"  function. 
The  application  program  calls  this  function  to  identify  points 
at  which  the  display  must  accurately  reflect  all  changes  that 
have  been  requested  by  calls  on  the  graphics  system.  These 
points  usually  directly  precede  requests  for  new  user  input: 
the  user  must  view  an  up-to-date  display  to  formulate  his  in¬ 
put.  This  function  plays  an  important  role  in  device  inde¬ 
pendence,  providing  good  service  to  storage-tube  displays  by 
reducing  the  number  of  screen  erasures  [31). 

On  a  single  computer,  a  device-independent  graphics  package 
of  this  sort  is  implemented  as  shown  in  Fig.  4(a).  The  package 
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(a) 


(b) 

Fig.  4.  The  organization  of  a  device-independent  graphics  package, 
(a)  On  a  single  computer,  the  software  is  divided  into  three  modules: 
the  application  (A),  the  device-independent  module  (Di),  and  the 
device-dependent  module  (Dd).  (b)  The  system  can  be  adapted  for  a 
network  implementation  by  using  communications  to  provide  the 
linkage  between  Di  and  Dd. 

is  divided  into  two  modules,  one  device  independent  (Di),  and 
one  device  dependent  (Dd).  Each  call  by  the  application  pro¬ 
gram  (A)  is  subjected  to  some  Di  processing,  and  then  usually 
results  in  a  call  on  the  Dd  module.  The  Dd  module  is  re¬ 
sponsible  for  building  the  segments  themselves  in  a  form  that 
can  be  interpreted  by  the  particular  display  processor  in  use. 
The  Di  module  can  be  used  with  many  different  Dd  modules 
with  the  same  procedure  interface— in  this  way,  the  application 
program  can  operate  a  variety  of  different  display  devices. 

Adapting  this  configuration  to  a  network  environment  is 
easy  (Fig.  4(b)).  The  module  resides  with  the  application  pro¬ 
gram  and  the  Dd  module  runs  in  the  computer  that  directly 
controls  the  display  hardware.  Network  communications  are 
used  to  achieve  the  effect  of  the  Di  module  calling  procedures 
in  the  Dd  module.  Because  the  procedure  interface  between  the 
Di  and  the  Dd  pans  is  standardized  to  allow  alternative  Dd 
modules,  the  standardization  is  easily  retained  in  the  network 
protocol.  Thus  the  segmented  display  file,  a  mechanism  de¬ 
vised  to  solve  problems  of  interactive  graphics  programs  in 
single  computers,  turns  out  to  be  adapted  easily  to  networks. 

Language-Input:  The  NGP  provides  access  to  a  variety  of 
graphical  input  devices,  for  it  is  the  interaction  afforded  by 
these  devices  that  makes  graphics  an  important  tool  in  many 
applications.  Unfortunately,  achieving  device  independence 
for  input  is  much  harder  than  for  output,  in  part  because  of 
the  enormous  range  of  input  devices.  An  additional  problem 
is  that  effective  use  of  input  devices  requires  generating  on  the 
display  screen  various  feedback  images,  such  as  cursors,  to 
allow  the  user  to  coordinate  his  input  with  images  already  dis¬ 
played.  The  range  of  feedback  techniques  is  even  wider  than 
the  range  of  input  devices! 

An  approach  to  device  independence  is  to  concentrate  on  the 
interactive  techniques  being  used  rather  than  on  particular  in¬ 
put  devices.  The  display  hardware  and  Dd  module  cooperate 
closely  to  provide  rapid  feedback  to  implement  the  technique. 
After  the  interaction  is  complete,  the  Dd  module  reports  to 
the  application  program  a  terse  summary  containing  essential 
details  of  the  entire  interaction.  For  example,  three  techniques 
in  the  repertoire  are  the  following. 

/ )  Positioning:  The  user  is  expected  to  identify  a  spot  on 
the  screen.  He  uses  a  coordinate  input  device  to  steer  a  dis¬ 
played  cursor  to  the  desired  spot,  and  then  depresses  a  button 
to  indicate  “this  is  the  point  I  mean.”  The  summary  returned 
to  the  application  program  contains  only  the  coordinates  of 
the  cursor  when  the  button  was  struck. 

2)  Stroke  collection:  The  user  is  expected  to  draw  a  trace 
on  the  screen  by  steering  the  cursor  at  the  same  time  the  but¬ 
ton  remains  depressed  (the  button  is  usually  a  switch  on  a 
stylus  input  device).  The  display  shows  the  resulting  trace  as 


feedback,  which  presents  an  effect  similar  to  drawing.  The 
summary  reported  to  the  application  program  is  a  list  of  points 
that  lie  on  the  trace. 

3)  Pointing:  The  user  is  expected  to  identify  an  item  al¬ 
ready  displayed  on  the  screen.  One  way  to  implement  this 
technique  is  to  steer  a  cursor  over  the  item  and  push  a  button. 
Alternatively,  a  light  pen  can  be  used  to  identify  the  item. 
The  summary  reported  contains  a  specification  of  which  item 
was  identified. 

This  entire  approach  focuses  on  input  techniques  rather  than 
on  devices  alone.  It  may  limit  the  flexibility  of  the  application 
program  in  using  some  device  in  a  clever  way.  However,  the 
flexibility  is  available  instead  to  the  Dd  module,  which  can 
implement  the  techniques  in  various  ways,  and  can  perhaps 
customize  the  implementation  for  a  particular  user. 

Language  -Inquiry :  Not  all  aspects  of  the  interface  to  the 
Dd  module  are  completely  standardized  ;  some  are  parametrized. 
For  example,  some  displays  allow  lines  to  be  distinguished  by 
their  brightness,  although  devices  differ  in  the  number  of 
brightnesses  provided.  A  standard  that  assumes  only  one 
brightness  prevents  the  use  of  this  technique  entirely.  One 
that  assumes  a  fixed  number  of  brightnesses  will  cause  con¬ 
fusion  on  a  display  that  provides  but  one.  A  solution  to  prob¬ 
lems  of  this  sort  is  to  let  the  application  program  or  Di 
module  inquire  about  features  of  the  display  hardware,  and 
thereafter  drive  the  display  accordingly.  Examples  of  pa¬ 
rameters  that  can  be  handled  this  way  are:  number  of  bright¬ 
nesses,  number  of  line  types  (e.g.,  dotted,  dashed),  physical 
screen  dimensions,  and  the  kinds  of  input  devices  available. 

Some  inquiry  results  can  have  a  profound  effect  on  the  oper¬ 
ation  of  the  application  program.  Many  features  of  the  proto¬ 
col  are  optional  and  may  not  be  implemented  by  the  hardware 
or  by  the  Dd  module;  inquiry  establishes  which  features  are 
absent.  Another  sort  of  inquiry  reveals  the  type  of  display 
being  used:  is  it  a  storage  tube  or  a  refreshed  display?  The 
application  program  may  choose  entirely  different  interaction 
styles  for  these  two  display  types,  because  changes  to  storage- 
tube  images  are  much  slower  than  changes  to  refreshed  images. 

Coding:  The  coding  for  the  NGP  is  very  simple:  we  encode 
procedure  calls  from  the  Dd  module  to  the  Di  module  as  a  se¬ 
quence  of  8-bit  bytes.  The  first  byte  identifies  the  procedure. 
Subsequent  bytes  provide  arguments;  each  procedure  has  an 
associated  format  for  arguments.  The  protocol  was  designed 
to  avoid  floating-point  numbers;  only  integers  are  used. 

Information  returned  by  the  Dd  module  to  the  Di  module  is 
structured  symmetrically,  as  if  the  Dd  module  were  calling 
procedures  in  the  Di  module  to  answer  queries,  to  report  inter¬ 
action  results,  and  so  forth. 

Transport:  The  coding  generates  a  stream  of  bytes  to  be 
transmitted,  just  as  the  plotter  protocol  does.  Because  most  of 
the  encoded  procedure  calls  do  not  return  answers,  the  bytes 
can  be  packed  into  messages;  messages  are  transmitted  when 
full.  It  is  occasionally  necessary  to  transmit  partial  messages: 
for  example,  the  “update”  call  forces  all  information  encoded 
by  Di  to  be  transported  to  Dd  so  that  the  display  will 
change.  It  may  also  be  important  to  transmit  quickly  com¬ 
mands  that  alter  the  handling  of  input  techniques. 

Summary:  We  have  seen  that  developing  the  ARPA  NGP 
was  primarily  a  matter  of  designing  device-independent  graphics 
package.  As  the  package  modules  are  designed,  we  find  that 
some  are  device-independent  and  others  depend  on  the  details 
of  particular  hardware.  Thus  a  natural  division  of  labor  be¬ 
tween  two  computers  is  to  execute  device-independent  mod- 
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Fig.  S.  The  structure  of  processes  using  the  network  voice  protocol.  Processing  of  the  protocol  is  divided  into  three  parts: 
sending  speech  data,  receiving  speech  data,  and  controlling  the  transmission.  Portions  enclosed  by  dashed  lines  are  'Wo-  .  .% 

coder  dependent.” 


ules  and  the  application  program  on  one  machine,  and  place 
the  device-dependent  modules  and  display  hardware  on  an¬ 
other.  A  network  provides  communication  between  the 
device-independent  and  device-dependent  modules  in  a  straight¬ 
forward  way.  (If  we  were  willing  to  standardize  the  procedure 
interface  between  the  application  program  and  the  device¬ 
independent  module,  this  interface  too  could  become  a 
point  of  separation  by  a  network.)  Thus,  methodology,  de¬ 
sign,  and  documentation  of  the  NGP  are  those  of  a  graphics 
package,  rather  than  of  an  ab  initio  protocol  design.  In  this 
setting,  the  network  becomes  a  relatively  unimportant  artifact 
of  the  implementation:  it  offers  interprocess  communication 
among  modules  of  a  software  system. 

D  A  Network  Voice  Protocol  (NVP) 

The  objective  of  an  NVP  is  to  control  the  transmission  of  a 
digital  representation  of  voice  signals  through  a  packet- 
switched  network  in  real  time.  The  transmission  may  be  part 
of  a  man-to-man  or  man-to-machine- communication.  A  pro¬ 
tocol  can  provide  the  effect  of  two-party  or  conference  calls, 
including  user  interfaces  such  as  dialing  and  ringing.  Or  it  can 
provide  a  voice  message  service  by  communicating  from  man 
to  a  computer  capable  of  first  recording  the  voice  data,  and 
and  later  responding  to  a  request  to  repeat  the  data  to  the  in¬ 
tended  recipients.  A  voice  protocol  differs  from  a  standard 
telephone  connection  in  that  applications  may  use  the  network 
to  transmit  additional  digital  information  along  with  the 
voice  data. 

A  voice  protocol  must  provide  natural  communication  be¬ 
tween  humans.  On  the  one  hand,  this  objective  seems  to  re¬ 
quire  extremely  good  performance.  Poor  reproduction  of  the 
speech  signal  may  prevent  recognition  of  the  speaker;  long 
transmission  delays  may  impede  a  dialogue;  small  amounts  of 
distortion  may  be  keenly  perceived  and  interfere  with  natural 
communication.  On  the  other  hand,  a  human  can  tolerate 
many  errors.  Occasional  noise  is  compensated  for  by  the  re¬ 
dundancy  in  human  speech;  catastrophic  garbling  is  repaired 
by  diaiogue-the  utterance  is  repeated. 

A  voice  protocol  should  try  to  provide  good  speech  quality 
even  though  data  transmission  rates  may  be  severely  limited 
by  the  network.  Fortunately,  techniques  are  available  for  en¬ 
coding  voice  as  low-bandwidth  digital  data;  these  techniques 
are  called  vocoding  techniques.  Vocoding  uses  real-time  signal- 
processing  algorithms  to  encode  and  decode  the  voice.  Vocod¬ 
ing  algorithms  tend  to  require  substantial  computing  to  achieve 
both  acceptable  reproduction  quality  and  low  data  bandwidth. 
High  compression  rates  also  tend  to  increase  sensitivity  to 
acoustic  noises  and  communication  errors.  Speech  quality, 
data  rate,  and  processing  all  interact:  achieving  higher  quality 
or  less  processing  requires  more  transmission  bandwidth.  One 
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of  the  motivations  for  a  device-independent  NVP  is  to  simplify 
experimentation  with  different  compromises. 

The  needs  of  a  voice  HLP  differ  from  those  of  a  graphics 
HLP  in  several  ways.  The  dominant  concern  in  an  NVP  is  the  ,q' 
vocoding  scheme  used,  often  because  special  fast  processing  .  ■ 
is  required  to  vocode  in  real  time.  Language  issues  are  rela- 
tively  minor.  A  voice  protocol  requires  a  real-time  transport  V- 
system:  delays,  and  especially  delay  distributions,  are  critical  •  ^ 
to  offering  real-time  service;  errors,  however,  can  be  tolerated.  . 
By  contrast,  the  NGP  can  tolerate  longer  delays  and  greater  '.-J 
variation  in  delay  for  “interactive”  service,  but  cannot  tolerate 
transmission  errors. 

An  experiment  in  voice  protocols  has  been  undertaken  by  *] 
four  sites  in  the  ARPA  network  [4J,  [5] .  Sites  with  differing 
hardware  for  implementing  network  control  and  vocoding  al- 
gorithms  were  deliberately  chosen  in  order  to  develop  a  pro- 
tocol  to  accommodate  such  differences.  ' 

Language:  The  language  for  the  NVP  has  three  functions.  ® 
First,  it  must  cope  with  different  sending  and  receiving  hard¬ 
ware  by  achieving  device  independence  in  some  way.  Second, 
it  must  support  user  interfaces  such  as  ringing  a  telephone, 
waiting  for  an  answer,  and  hanging  up.  Third,  it  must  control 
the  transmission  of  a  voice  signal  in  real  time.  The  protocol 
separates  these  functions  into  two  distinct  parts:  a  control 
protocol  and  a  data  protocol.  These  are  separate  protocols 
that  link  separate  processes  (see  Fig.  5).  The  data  processes  , 
are  concerned  only  with  transmitting  speech  data;  the  control 
processes  provide  the  other  two  functions  *  the  language.  V 

Some  of  the  NVP  language  is  devoted  to  controlling  the  con- 
versation  and  to  exchanging  information  about  its  status.  For 
example,  the  caller  process  tnes  to  initiate  a  call  with  “Calling  Pi 
(who)  from  (whom),”  where  each  field  addresses  one  of  several  ' 
telephone  lines  connecting  to  the  NVP  equipment.  The 
answerer  may  reply  “Ready”  or  “Goodbye,  I  am  busy.”  If  all 
goes  well,  the  answerer  may  say  “Ringing,”  and  eventually 
“Ready”  if  the  call  is  answered.  These  exchanges  are  simply 
designed  to  allow  caller  and  answerer  to  know  each  other's 
state:  ringing,  ready,  terminated.  (See  Fig.  6  fora  summary  of  ^ 
the  important  states  of  the  NVP,  and  Table  II  for  a  summary 
of  the  protocol.) 

A  more  interesting  part  of  the  control  protocol  is  the  nego-  ^ 
tiation  of  various  parameters  of  the  coding  process.  Although 
the  parameters  are  mentioned  below,  we  shall  illustrate  the 
negotiation  process  here.  One  of  the  two  processes  (caller  or  ^ 
answerer)  is  the  negotiation  master.  This  process  makes  pro-  7 
posals  for  parameter  values  that  the  other  may  accept  or  reject. 

A  proposal  takes  the  form  “Can  you  do  (what)  (how)'’"  The 
(what)  and  (how)  fields  are  coded  from  a  small  set  ((how)  is  ac- 
tually  a  list  of  ways  of  doing  (what)  that  are  acceptable  to  the 
master).  For  example,  the  master  asks  “Can  you  Jo  ^speech 
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Fig.  6.  The  stale  diagram  for  the  network  voice  protocol.  Large  circles 
indicate  states.  Transitions  between  states  occur  when  a  message  is 
transmitted  (arc  with  bos)  or  received  (arc  with  oval).  Note  that 
the  caller  and  cailee  state  diagrams  are  nearly  symmetric. 


signal  sampling  every)  ’62  microseconds)  or  <100  microsec¬ 
onds)?”  The  other  process,  the  negotiation  slave,  may  respond 
in  one  of  two  ways.  A  positive  response  is  “!  will  do  (speech 
signal  sampling  every)  <62  microseconds).”  A  negative  reply 
says  “I  can’t  do  (speech  signal  sampling  every)  in  any  way  you 
suggested,”  but  may  optionally  contain  a  suggested  <how)  that 
is  acceptable.  Although  at  any  instant  one  of  the  processes  is 
the  master  and  one  the  slave,  either  may  request  renegotiation 
to  exchange  roles. 

An  additional  version  provision  makes  the  negotiation  more 
convenient.  If  a  large  number  of  parameters  must  be  agreed 
upon,  negotiating  each  one  is  tedious.  We  define  a  version  to 
be  a  table  of  (<what),  <how>)  pairs  that  gives  settings  for  a  large 
number  of  parameters.  Then  the  single  negotiation  “Can  you 
do  (version)  <2 >?”  will  establish  many  parameter  settings. 


This  negotiation  scheme  is  designed  to  be  extremely  robust.' 
Messages  lost  or  delivered  out  of  sequence  will  not  garble  the 
negotiation  because  each  response  repeats  the  question  ((what)) 
as  well  as  providing  an  answer. 

Coding:  Strictly  speaking,  the  voice  protocol  is  concerned 
only  with  controlling  transmission  of  voice  data,  and  not  with 
the  techniques  used  to  generate  the  data.  Nevertheless,  good 
vocoding  techniques,  providing  acceptable  quality  speech  with 
modest  transmission  b-nd  widths,  are  the  key  to  an  NVP.  A 
vocoder  may  be  presented  with  16  000  10-bit  samples  of 
speech  waveform  each  second,  and  be  required  to  encode  this 
information  in  as  few  as  3500  bits.  Clearly,  such  encodings 
cannot  preserve  all  of  the  information  in  the  original  signal, 
bpt  must  strive  to  make  errors  that  have  only  small  perceptual 
effects. 
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TABLE  I! 

Summary  of  ARPA  Network  Voice  Protocol  Messages. 
Details  of  the  Protocol  Are  Given  in  [4] 


Control  mess4p.es: 

caJImg  ‘who-*  <«*ho*>  <'espome-ltnk> 
goodbye  <code» 

ner.olnlton.inquiry  <wn*l»  <li«t. length*  <how  '  sf> 

positive  r*#poti»tion.resoons*  <wh*l>  <ho*» 

rtep>liv*  ne^otiahod. response  <wh*t»  <try.th»*.v«..*» 

ready 

not  ready 

inquiry 

rinp.nfl 

echo  request  <id» 

echo  <id> 

fencf.Qtiation.request  <master  requett> 
renecot'aliorvapprovat  <master.request>  <ok> 


_.e samples  ot  <whjt>  is  negotiated: 

‘vocoding* 

<sample  period* 

<version» 

<ma«  ms*  length* 

«IPC  degree* 

<CVSO  time. constant* 

‘samples  oer  parcel* 

<L PC. acoustic  coding* 

<L  PC.  m  to.  coding* 

‘CPC  pre  emphasis* 

‘LPC.t  ables  sel* 

Data  messages: 

<time  stamp*  < skipped  parcels. flag*  <oarcel.ccunt>  ‘parcel*  _ 


Although  a  great  many  signal  coding  techniques  are  available 
[12],  the  ARPA  NVP  is  currently  experimenting  with  two: 
continuously  variable  slope  delta  modulation  (CVSD)  and 
linear  predictive  coding  (LPC)  (23].  These  two  techniques 
are  very  different;  CVSD  aims  to  produce  an  output  waveform 
that  looks  close  to  the  original,  whereas  LPC  tries  only  to  pro¬ 
duce  an  output  signal  that  sounds  like  the  original.  Experience 
gathered  in  recent  years  has  made  LPC  the  more  popular  tech¬ 
nique  for  low  data-rate  vocoding. 

Both  vocoding  schemes  operate  within  a  similar  framework. 
An  interval  of  speech  is  sampled  and  encoded  by  calculating 
the  values  of  a  number  of  parameters.  These  values  are  then 
packed  into  parcels  for  transmission  through  the  network. 
The  negotiation  protocol  is  used  to  establish  the  type  of  vo¬ 
coding  used  (LPC  or  CVSD),  the  sampling  rate,  the  length  of 
the  period  encoded  in  each  parcel,  and  other  details  of  the 
coding. 

CVSD  vocoding  requires  only  modest  processing,  but  does 
not  achieve  very  good  compression.  Typically,  62  ms  of 
speech  is  encoded  into  about  1000  1-bit  parameters  that  are 
packed  into  a  parcel  for  transmission,  thus  requiring  about 
1 6-kbits/s  transmission  rates.  Some  additional  reduction  in 
data  rate  comes  from  detecting  inactivity  of  the  speaker  due 
to  breathing,  intersentence  pauses,  or  listening  to  the  other 
party.  These  periods  of  silence  can  be  coded  with  very  few 
bits;  both  CVSD  and  LPC  coding  take  advantage  of  this 
reduction. 

LPC  employs  two  coding  steps  to  reduce  the  volume  of 
speech  data.  First,  an  acoustic  vocoding  is  applied  to  a  short 
period  of  sampled  speech  signal,  and  yields  about  a  dozen  pa¬ 
rameters  describing  the  signal  Second,  an  information  coding 
step  converts  numeric  values  of  these  parameters  to  a  smaller 
range  of  values  and  packs  them  into  a  parcel  for  transmission. 
The  parcel  is  the  unit  of  information  flow,  containing  about 
60  bits  generated  every  19,2  ms.  Several  parcels  are  packed 
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together  (up  to  about  14)  into  messages  for  transport  through 
the  network.  Thus  only  3500  bits/ s  are  generated.  This  num¬ 
ber  can  be  reduced  even  further  by  use  of  “variable  rate  LPC” 
that  takes  advantage  of  acoustic  redundancies  (e.g.,  long 
vowels)  to  reduce  the  information  content  of  the  parameters. 

The  LPC  acoustic  vocoding  algorithm  fits  a  model  of  the 
human  vocal  system  to  a  19.2-ms  interval  of  speech  and  de¬ 
livers  parameters  describing  the  fit.  The  inverse  process  is 
applied  to  regenerate  the  speech,  but  it  does  not  yield  a  signal 
identical  to  the  original.  The  result  is  expected  to  sound  like 
the  original  even  though  it  does  not  look  like  the  original. 

The  information  coding  step  is  simple  compared  to  the  LPC 
calculations.  Although  the  LPC  parameters  vary  over  a  wide 
range,  the  full  precision  need  not  be  transmitted.  The  range  of 
the  parameters  is  broken  into  32,  64,  or  128  intervals  with 
roughly  equal  probability  of  occurrence;  parameter  values  are 
mapped  via  tables  into  either  5,  6,  or  7  bits  for  transmission. 
The  receiver  inverts  this  nonlinear  transformation. 

Transport:  The  NVP  requires  real-time  transport  of  parcels. 
Unfortunately,  none  of  the  standard  transport  facilities  de¬ 
signed  for  HLP  use  in  the  ARPA  network  [  1  ] ,  [6]  could  pro¬ 
vide  such  service;  the  NVP  designed  its  own  real-time  protocol 
using  only  the  facilities  of  the  IMP-based  communication 
subnet. 

The  real-time  protocol  is  very  simple  and  is  applicable  to 
real-time  uses  other  than  speech.  Parcels,  generated  at  a  con¬ 
stant  rate,  are  collected  into  messages.  Each  message  is  time- 
stamped  with  the  time  of  creation  of  the  first  parcel  in  the 
message  (the  units  of  time  measurement  correspond  to  the 
time  interval  represented  by  a  single  parcel).  During  periods 
of  silence,  parcels  are  simply  not  generated,  although  any 
partially  filled  message  will  be  transmitted  promptly  as  the 
silence  begins.  (The  treatment  of  silence  is  in  fact  somewhat 
tricky;  see  [3].)  The  time  stamps  allow  the  receiver  to  re¬ 
sequence  messages  that  arrive  out  of  order  and  to  meter  the 
reproduction  of  speech  from  the  parcels. 

Irregularities  in  message  delays  in  the  network  cause  trouble 
[  1 3 1 .  Suppose  the  first  message  experiences  340-ms  delay  and 
the  receiver  begins  processing  the  first  parcel  of  this  message  as 
it  is  received,  [f  the  next  message  experiences  360-ms  delay, 
the  receiver  will  finish  the  last  parcel  of  the  first  message  20  ms 
before  the  next  message  arrives.  What  is  it  to  do?  The  re¬ 
ceiver  could  have  delayed  processing  the  first  message  some¬ 
what  in  order  to  allow  more  leeway  against  subsequent  longer 
network  delays.  This  leeway  can  be  allowed  to  grow  as  longer 
delays  occur.  However,  if  the  variation  in  the  network  delay  is 
high,  the  receiver’s  delay  may  become  so  long  that  an  inter¬ 
ruption  by  the  listener  is  delayed  awkwardly  or  that  the 
speaker  notices  a  pause  when  he  stops  speaking  and  begins 
listening.  Low  variation  in  network  delay  is  a  clear  require¬ 
ment  for  an  NVP. 

Very  few  conventional  transport  services  are  required  by  an 
NVP.  No  retransmission  to  remedy  errors  is  required  because  a 
lost  message  is  not  catastrophic;  both  coding  schemes  are  de¬ 
signed  so  that  a  parcel  is  decoded  without  knowledge  of  previ¬ 
ous  messages.  Moreover,  retransmission  would  introduce  highly 
variable  delays  that  cause  worse  perceptual  damage  than  the 
loss  of  the  message.  The  only  effect  of  a  lost  message  is  an 
audible  error  for  the  period  represented  by  the  missing  data. 
Because  of  the  real-time  nature  of  the  NVP,  there  is  also  no 
need  for  user-enforced  flow  control  sender  and  receiver  ire 
processing  messages  at  identical  rates. 

Results.  The  ARPA  NVP  was  first  used  successfully  in 
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December  1974  for  real-time  voice  communication  through 
the  ARPA  network  from  coast  to  coast  It  has  been  imple¬ 
mented  at  five  sites  and  has  been  used  regularly  since  then.  It 
has  been  used  experimentally  with  LPC  vocoding  at  rates  be¬ 
tween  1 .8  and  5  kbit/s,  and  with  CV SD  at  rates  between  8  and 
18  kbits/s.  .Although  only  a  few  sites  have  participated  in  the 
ARPA  NVP,  dissimilarities  among  the  hardware  and  software 
environments  have  thotoughly  exercised  the  device  indepen¬ 
dence  aims  of  the  protocol.  Some  of  the  initial  implementa¬ 
tions  used  one-of-a-kind  experimental  devices  such  as  the 
Lincoln  Laboratory  TX-2.  A  typical  site  uses  a  minicomputer 
attached  to  the  network  and  to  a  fast  arithmetic  unit  used  for 
vocoding  algorithms  (e.g.,  PDP-1 1/45  and  FPS  AP-1  20A). 

The  NVP  has  been  extended  to  handle  the  “higher  level” 
functions  of  storing  voice  on  files  and  of  conferencing.  In  both 
cases,  the  data  and  negotiation  protocols  remained  unaltered 
while  extensions  were  made  to  the  control  protocol  [5].  In 
conferencing,  for  example,  control  messages  are  used  to  estab¬ 
lish  which  speaker  “has  the  floor,”  and  to  whom  speech 
should  be  sent.  Additional  extensions  to  transmit  voice  to  a 
speech  recognition  system  or  from  a  speech  synthesis  system 
are  feasible  because  LPC  is  the  basis  for  some  approaches  to 
these  goals  [27],  All  of  these  extensions  are  accommodated 
with  only  minor  changes  to  the  protocol  because  the  basic 
NVP  framework  and  vocoding  required  no  alterations. 

Although  the  voice  application  seems  to  be  an  unrepresenta¬ 
tive  HLP,  we  have  seen  that  it  offers  a  new  perspective  on 
several  HLP  functions.  It  shows  a  simple  elegant  negotiation 
mechanism.  It  shows  the  effort  that  must  sometimes  be  de¬ 
voted  to  data  compression.  It  demonstrates  a  real-time  end- 
to-end  protocol.  And  most  importantly,  it  places  an  unusual 
set  of  requirements  on  the  transport  system:  low  delay  vari¬ 
ation,  minimal  flow  control,  and  no  need  for  extreme  re¬ 
liability.  Diverse  needs  such  as  these  require  the  transport 
system  to  be  flexible. 

III.  Designing  and  Implementing  HLP’s 

This  section  assembles  the  various  techniques  used  to  design 
and  implement  HLP’s  by  referring  liberally  to  the  three  illus¬ 
trative  protocols  of  Section  III,  and  also  to  other  HLP’s.  This 
compendium  is  a  prelude  to  an  examination  of  the  relationship 
between  HLP’s  and  HLL’s.  Our  observations  are  grouped  into 
the  categories  of  language,  coding,  and  transport. 

A.  Language 

The  language  for  an  HLP  is  clearly  the  most  important  part 
of  the  design.  Meeting  device  independence  and  standardization 
objectives  is  a  major  challenge;  it  is  difficult  to  abstract  the 
common  properties  of  a  range  of  denies  and  applications  in  a 
design  that  has  a  simple  form.  It  is  often  difficult  to  avoid 
letting  quirks  of  inflexible  operating  systems  or  of  special 
hardware  completely  destroy  an  otherwise  simple  protocol. 
The  network  is  not  an  extrinsic  contributor  to  the  problem: 
it  adds  a  few  new  difficulties  to  those  encountered  in  design¬ 
ing  a  nonnetwork  system  involving  multiple  asynchronous 
communicating  processes.  Nevertheless,  the  presence  of  the 
network  makes  communication  along  such  diverse  computer 
hardware  and  devices  possible;  thus  it  is  the  reason  for  much 
of  the  pressure  for  device  independence  and  standardization. 

Device  fndependence  Inherent  in  device-independent  de¬ 
signs  is  a  tension  between  generality  and  standardization.  As 
more  capabilities  of  devices  or  a  wider  range  of  applications 
are  encompassed  by  the  design,  standardization  becomes  more 


difficult.  The  NGP  designers,  for  example,  wanted  to  allow 
dynamically  moving  images  to  be  displayed,  provided  that  the 
display  hardware  made  some  provision  for  applying  geometrical 
transformations  to  the  display  file  rapidly  enough  to  see  mo¬ 
tion.  The  attempt  failed  because  the  designers  could  devise  no 
way  to  standardize  the  range  of  transformation  facilities  pro¬ 
vided  by  display  manufacturers.  However,  if  aspirations  of  de¬ 
vice  independence  were  reduced,  and  the  NGP  were  required 
to  operate  only  displays  of  a  certain  type,  dynamic  images 
could  be  accommodated  easily.  Thus  an  attempt  at  generality 
was  curbed  by  standardization  problems. 

Generality  can  also  lead  to  complexity.  Overly  general  de¬ 
signs  are  difficult  to  implement,  may  take  a  long  time  to  devise, 
and  often  require  more  computing  resources  to  process.  One 
way  to  reduce  this  problem  is  to  divide  the  facilities  of  an  HLP 
into  quasi-independent  groups  of  functions  with  different  de¬ 
grees  of  generality.  Any  particular  implementation  may  offer 
only  a  subset  of  the  groups.  The  NGP,  for  example,  recog¬ 
nizes  four  groups  of  commands  (status  and  inquiry,  segmented 
display  files,  positioned  text,  input  interactions).  Entire  groups 
are  optional-facilities  for  input  interactions,  for  example,  need 
not  be  provided.  Within  a  group  are  both  mandatory  and  op¬ 
tional  commands.  If  the  options  are  not  used,  a  very  simple 
implementation  results.  If  implementers  need  the  added  fea¬ 
tures,  the  NGP  specifies  a  standard  way  to  provide  them. 

At  the  other  extreme  are  protocols  that  are  not  very  general, 
but  are  extremely  simple.  These  unambitious  HLP’s  can  offer 
substantial  service.  A  file  transfer  protocol  that  deals  only 
with  text  files,  or  a  graphics  protocol  that  is  little  more  am¬ 
bitious  than  our  plotter  example,  is  remarkably  useful.  A  de¬ 
sign  that  provides  a  special  feature  in  a  device-independent 
way  is  worthless  if  it  is  never  used.' 

HLP’s,  whether  simple  or  general,  usually  provide  some  mea¬ 
sure  of  device  independence.  Two  approaches  are  used  to  deal 
with  different  devices. 

1 )  The  “virtual  device  approach.”  A  standard  is  designed 
that  specifies  the  operations  that  a  process  must  implement. 
The  plotter  protocol  is  an  example.  The  plotter  process  may 
be  able  to  forward  some  protocol  commands  to  the  plotter 
hardware  directly;  others  it  may  have  to  “simulate"  in  some 
way.  For  example,  if  the  standard  protocol  specified  colored 
lines,  the  plotter  process  might  replace  them  by  dashed  lines  if 
no  colored  pens  ire  available.  This  approach  derives  its  name 
from  the  similarity  with  “virtual  memory:”  a  computer  that 
provides  a  million  words  of  virtual  memory  need  not  have  that 
much  primary  memory,  but  a  collection  of  mapping  and  paging 
processes  simulate  the  effect  of  a  million  words  of  primary 
memory. 

2)  The  “parametric  approach.”  In  this  approach,  the  process 
preparing  to  send  protocol  commands  will  inquire  of  the 
plotter  process  information  about  its  capabilities’  How  many 
colors  can  be  plotted?  What  is  the  size  of  the  plotter  surface? 
What  resolution'1  Given  the  answers  to  these  questions,  the 
sending  process  can  arrange  to  use  only  features  of  the  plotter 
that  are  available.  Although  this  technique  still  requires  stan¬ 
dard  ways  to  specify  all  possible  plotter  commands  and  standard 
ways  to  inquire,  the  burden  of  simulating  in  software  those 
capabilities  lacking  in  the  plotter  is  shifted  to  the  sender. 

One  benefit  of  the  parametric  approach  is  that  the  applica¬ 
tion  program  can  be  advised  of  the  plotter's  properties  and  can 
take  advantage  of  the  information.  For  example,  if  color  is  not 
available,  the  application  program  can  simulate  :t  with  line 
textures  that  do  not  conflict  with  textures  already  used  by  the 
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program.  The  technique  also  allows  coding  to  be  parametrized: 
for  example,  the  plotter  process  could  specify  how  many  digits 
it  requires  for  each  coordinate  value. 

An  important  generalization  of  the  parametric  approach, 
called  negotiation,  makes  the  inquiry  activity  symmetric.  Each 
process  may  inquire  about  properties  of  the  other,  often  by 
asserting  its  request  for  a  certain  behavior:  sender  says  “Can 
you  do  four  colors?"  and  plotter  replies  either  “Yes,  IT1  do 
four  colors”  or  “No,  1  cannot  do  four  colors."  This  technique 
allows  each  process  to  adapt  to  the  other- they  may  end  up 
being  each  parameterized  according  to  some  of  the  other’s 
wishes.  Once  again,  the  method  for  negotiating,  and  the 
eventual  commands  that  flow,  must  be  standardized. 

The  basic  difference  between  these  two  approaches  concerns 
how  the  burden  of  achieving  device  independence  is  divided 
between  the  two  communicating  processes.  The  parametric 
approach,  used  by  the  NGP,  NVP,  and  TELNET  [ 7  ] ,  essentially 
delays  the  specification  of  certain  parameters  until  the  two 
communicating  parties  are  connected. 

The  methods  used  to  achieve  device  independence  can  pro¬ 
foundly  influence  the  structure  of  application  programs  that 
use  the  HLP.  For  example,  the  plotter  protocol  and  the  NGP 
are  both  graphics  protocols,  but  a  program  must  use  them  very 
differently,  because  they  use  fundamentally  different  tech¬ 
niques  to  make  changes  to  the  display.  An  applica,  -m  written 
to  use  a  segmented  display  file  could  not  be  adapted  easily  to 
use  the  plotter  protocol.  Moreover,  the  parametric  approach 
to  standardization  may  induce  the  application  to  operate  dif¬ 
ferently  for  different  parameter  values.  For  example,  an  inter¬ 
active  graphics  application  may  drive  a  storage  tube  very  differ¬ 
ently  than  a  refresh  display  (which  reflects  display  changes 
more  rapidly),  even  though  the  program  uses  the  same 
device-independent  methods  to  draw  lines  and  text  or  to  re¬ 
ceive  input. 

Combining  Protocols.  An  application  may  use  several  sepa¬ 
rate  HLP's  to  connect  its  constituent  parts.  The  different  pro¬ 
tocol  uses  are  often  independent,  and  use  separate  logical 
communication  paths.  But  sometimes  HLP's  can  unwittingly 
compete  for  resources.  A  good  example  is  illustrated  in  Fig. 
7(a),  in  which  a  user  employs  a  TELNET  protocol  to  control  a 
graphics  program,  which  in  turn  uses  a  graphics  protocol  con¬ 
nection  to  control  a  display.  Most  users  prefer  to  see  text  and 
graphical  output  presented  on  the  same  screen  (Fig.  7(b))- 
TELNET  and  the  NGP  thus  compete  for  screen  space.  They 
also  compete  for  keyboard  input.  But  neither  protocol  is 
aware  of  the  other’s  needs.  The  NGP  design  group  considered 
allowing  the  NGP  to  control  the  placement  of  TELNET  text 
on  the  screen,  but  abandoned  the  idea  when  it  became  clear 
such  control  would  require  synchronizing  the  display  process 
(Dd)  and  the  TELNET  process  (Tu)  whenever  screen  place¬ 
ment  changed.  This  problem  was  attacked  seriously  in  a  sub¬ 
sequent  graphics  protocol  designed  to  allow  careful  control  of 
“windows"  on  the  screen,  some  of  which  contain  TELNET 
text,  and  some  graphical  information  (371,(351. 

Robustness:  An  HLP  provides  a  vttai  communication  link  in 
an  application  distributed  over  several  computers,  a  link  whose 
failure  may  be  costly.  If  a  person  spends  an  hour  using  an 
interactive  Jata-analysis  program  that  uses  the  NGP  to  display 
results,  and  a  communication  error  causes  so  much  confusion 
that  the  program  aborts,  a  great  deal  of  work  is  lost.  Providing 
the  reliability  to  prevent  these  failures  requires  attention  at  all 
levels.  The  application  program  can  save  a  user's  state  peri¬ 
odically  in  case  of  catastrophe,  network  transport  facilities 
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Fig.  7.  Logical  connections  for  graphics  and  terminal  protocol*,  (a)  In¬ 
dependent  connection*,  with  the  display  and  terminal  physically 
separate,  (b)  Independent  connections,  but  “terminal  text'*  is  pre¬ 
sented  on  the  display,  (c)  A  single  connection  for  both  graphic*  and 
terminaj  protocols,  showing  the  insertion  of  “uninvited  text”  by  an 
operating  system. 

can  provide  some  measure  of  error  detection  and  correction 
(1],  (2),  [18]  and,  finally,  the  HLP  language  can  take  steps 
to  improve  reliability,  such  as  tolerating  lost  or  duplicated 
messages.  Redundant  coding  can  be  used  to  help  detect  errors. 
To  avoid  loss  of  synchronization  in  a  stream  of  commands  and 
arguments,  a  common  technique  is  to  occasionally  identify 
the  beginning  of  a  command,  for  example,  by  marking  a  mes¬ 
sage  to  indicate  that  a  command  appears  as  the  first  item  m 
the  message. 

Specification :  Implementations  of  HLP  languages  usually 
depend  on  the  proper  interpretation  of  a  rather  detailed  speci¬ 
fication.  The  detail  results  chiefly  from  the  requirement  that 
ail  implementations  of  the  HLP  communicate  with  an  identical 
language,  rather  than  from  complexity  in  the  HLP  itself.  In¬ 
deed,  the  designer  of  an  HLP  seeks  a  simple  elegant  solution 
that  offers  the  necessary  services  and  has  a  straightforward 
implementation. 

A  precise  HLP  specification  is  difficult  to  write.  Although  it 
is  similar  to  a  specification  for  a  “software  interface”  routinely 
used  to  detail  the  design  of  j  module  of  a  software  system,  it 
differs  in  two  respects.  First,  it  must  be  understood  by  HLP 
implemented  accustomed  to  different  programming  languages, 
operating  systems,  conventions,  and  nomenclature.  Second, 
because  the  impl ementer  may  not  be  familiar  with  the  applica¬ 
tion  addressed  by  the  HLP,  the  specification  must  provide  con¬ 
siderable  implementation  guidance.  We  shall  argue  in  Section 
IV  that  notions  from  HLL’s  can  be  used  to  improve  Ml  p 
specifications. 

Layering  Layering  helps  to  avoid  HLP  c on  e'e ' . ■ 

HLP  can  be  implemented  on  top  it  another  >  • 

ing  is  exploited  by  all  HLP’s,  the;,  use  e 

level  transport  protocols  to  proi.de  1 
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HLP’s  can  also  be  layered  atop  other  HLP’s.  Protocols  for 
conferencing  and  voice  recording  use  the  NVP  as  a  base.  An 
HLP  that  uses  a  coding  based  on  character  strings  may  be 
implemented  on  top  of  an  HLP  such  as  TELNET  that  trans¬ 
mits  character  streams  between  different  computers. 

Sometimes  the  opportunity  to  layer  one  HLP  on  top  of  an¬ 
other  is  impeded  by  poor  protocol  design  or  a  shortsighted 
implementation.  The  NGP,  for  example,  might  have  been 
layered  on  top  of  TELNET,  so  that  the  two  connections  of 
Fig.  7(a)  could  be  reduced  to  one  (Fig.  7(c)).  This  simplifies 
the  implementation  of  Dd  and  Tu,  requiring  only  one  process. 
It  also  simplifies  implementation  of  careful  control  of  terminal 
text,  as  the  single  connection  avoids  the  need  to  synchronize 
the  display  parameters  in  Dd  to  the  arrival  of  text  from  Tu. 
However,  the  single  connection  approach  was  rejected  by  the 
ARPA  NGP  as  too  error-prone.  Because  an  operating  system 
may  unexpectedly  generate  text  to  be  printed  on  the  terminal 
(e.g.,  “System  going  down.”),  it  may  interfere  with  graphical 
information  encoded  as  text  on  the  Ts-Tu  connection.  Gearly, 
the  TELNET  protocol  and  operating  systems  could  provide  a 
means  for  transmitting  encoded  information  that  would  not 
be  subject  to  disturbance.  Such  a  facility,  which  would  allow 
an  HLP  to  be  implemented  within  the  TELNET  protocol,  was 
not  available  when  the  NGP  was  designed. 

B.  Coding 

Coding  can  be  a  simple  matter  or  a  very  complex  one  in¬ 
deed.  It  is  simple  if  we  need  only  transmit  integers  or  symbols 
from  a  well-specified  vocabulary  (e.g.,  an  integer  that  identifies 
a  command).  Straightforward  codings  are  easy  to  encode  and 
decode;  the  NGP  and  NVP  control  messages  are  examples  of 
simple  codings.  NVP  data  messages  are  simple  encodings  of 
parameters,  although  the  parameters  are  extracted  from  speech 
samples  by  a  complex  vocoding  algorithm.  Although  greater 
investment  in  processing  can  be  used  to  compress  the  coding, 
no  HLP’s  known  to  the  authors  use  sophisticated  minimum- 
entropy  information  coding  techniques.  This  may  be  a  result 
of  greater  attention  to  the  less  well  understood  problems  of 
language  and  transport  than  to  coding  efficiency.  Moreover, 
large  efficiency  gains  can  often  be  achieved  by  appropriate  de¬ 
sign  of  the  protocol  language:  the  segmented  display  file  con¬ 
cept  in  the  NGP  often  a  greater  reduction  in  data  transmission 
than  would  minimum  entropy  coding  of  entire  images. 

Coding  problems  arise  when  we  try  to  encode  objects  that 
have  different  interpretations  in  different  computers:  “floating¬ 
point  numbers”  or  “text  characters”  are  examples.  What  hap¬ 
pens  if  a  “character”  has  no  representation  in  the  character  set 
of  the  computer  to  which  it  must  be  transmitted?  What  happens 
when  a  floating-point  number  lies  outside  the  range  of  num¬ 
bers  representable  on  the  destination  computer?  Although  it 
is  possible  to  characterize  precisely  the  acceptable  ranges,  it  is 
extremely  difficult  to  design  a  protocol  that  works  properly 
when  a  sender  wishes  to  transmit  an  object  that  cannot  be  in¬ 
terpreted  by  the  receiver.  Generally,  these  problems  are 
handled  by  guaranteeing  interpretations  only  on  restricted 
subsets,  such  as  “ASCII  standard  characters,”  or  even  “upper 
case  characters.” 

C.  Transport 

An  important  lesson  about  transport  systems  has  emerged 
from  work  on  HLP’s.  No  single  abstract  view  of  the  transport 
facility  applies  to  all  HLP’s;  different  protocols  need  access  to 


transport  facilities  of  different  types.  Transport  facilities 
should  be  designed  in  a  modular  way  to  provide  different 
types  of  service  to  different  HLP’s.  The  graphics  and  voice 
protocols  described  in  this  paper  illustrate  the  range  of  trans¬ 
port  services  required.  The  NGP  depends  on  orderly  trans¬ 
mission  of  streams  of  bytes,  and  consequently  requires  many 
supporting  transport  mechanisms  offered  by  1PC  protocols 
IH,  12],  [37],  [38]:  connections,  flow  control,  sorting  out- 
of-order  packets,  duplicate  packet  detection,  and  loss  preven¬ 
tion.  Small  sacrifices  in  bandwidth  or  irregularities  in  trans¬ 
mission  delay  are  unimportant.  By  contrast,  the  NVP  requires 
a  stable  transmission  delay,  but  can  tolerate  errors,  lost  packets, 
or  packets  discarded  due  to  local  network  congestion. 

Other  HLP’s  may  fall  at  points  in  between.  For  example, 
some  do  not  need  to  establish  network  “connections,”  but  can 
be  designed  around  inquiry-response  exchanges,  sometimes 
called  DATAGRAMS  [26],  A  protocol  to  look  up  telephone 
numbers  might  simply  address  a  single  message  containing  a 
person’s  name  to  the  retrieval  system;  it  expects  a  single 
answering  message  in  response.  If  the  sender  receives  no 
answer,  he  can  repeat  the  request.  A  surprisingly  large  variety 
of  protocols  can  be  cast  in  this  framework. 

The  grain  of  communication  required  of  the  transport  sys¬ 
tem  also  varies  among  HLP’s.  Some  will  want  to  transmit 
streams  of  bits  or  bytes;  others  may  compose  indivisible  mes¬ 
sages  to  be  delivered  as  atomic  entities.  In  implementing  a 
byte  stream  in  a  packet  switching  network,  the  transport  sys¬ 
tem  will  attempt  to  buffer  bytes  until  a  message  fills  up.  But 
how  often  should  packets  be  sent?  If  a  full  message  is  as¬ 
sembled,  presumably  it  should  be  sent  immediately,  subject 
only  to  flow  control  practices  in  the  network.  But  what  if  a 
message  is  only  partially  filled,  and  has  for  several  seconds 
seen  no  data  added  to  it?  Should  it  be  sent,  even  though  only 
partially  full?  Perhaps  the  receiving  process  is  idle,  and  could 
begin  working  on  the  new  data.  Or  perhaps  the  protocol  is  be¬ 
ing  used  as  part  of  an  interactive  dialog,  and  the  user  needs  to 
see  the  effect  of  the  data  contained  in  the  partial  message.  Or 
perhaps  the  message  is  partially  full  only  because  it  is  the  last 
message  of  a  long  transmission  (e.g.,  when  transmitting  an 
entire  file). 

The  question  used  to  resolve  these  questions  is:  on  what  unit 
of  information  can  the  receiving  process  take  significant  ac¬ 
tion?  If  we  want  the  receiver  to  act  each  time  a  new  byte  is 
generated  by  the  sender,  we  must  ensure  that  no  encoded  in¬ 
formation  is  buffered  at  the  sender  for  very  long,  i.e.,  that  par¬ 
tially  filled  messages  are  transmitted  at  short  intervals.  If,  on 
the  other  hand,  we  are  concerned  only  with  finishing  a  long 
transmission  as  quickly  as  possible,  using  full-size  packets  for 
efficiency,  we  can  expect  the  sender  to  indicate  that  trans¬ 
mission  of  the  very  last  (partially  filled)  packet  is  required 
even  though  it’s  not  full. 

The  transmission  strategy  used  to  make  these  decisions  is 
determined  in  part  by  application  process.  Two  facilities 
should  be  provided. 

1)  The  sending  process  states  a  time  interval  that  is  the 
maximum  time  any  chunk  of  information  is  buffered  before 
sending  (assuming  network  flow  control  conventions  allow 
transmission).  This  time  may  be  very  long  if  timely  trans¬ 
mission  is  unimportant,  as  in  a  file  transfer,  or  it  may  be  re¬ 
lated  to  the  amount  of  data  accumulated  for  transmission  [3] . 
(Note  that  such  a  scheme  is  an  essential  aspect  of  “terminal 
output” -characters  must  be  periodically  transmitted  for  the 
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user  to  see.  Unfortunately,  many  operating  systems  that  pro¬ 
vide  interfaces  to  network  transport  mechanisms  provide  this 
timely  transmission  facility  only  for  terminal  output;  they  do 
not  implement  the  technique  for  other  HLP  network  traffic.) 

2)  The  sending  process  notifies  the  transport  mechanism 
when  it  is  important  that  buffered  information  actually  be 
transmitted. 

The  needs  of  different  HLP's  are  sufficiently  varied  to  induce 
different  behaviors  in  the  way  transport  is  provided:  messages 
with  time  criticality  of  100  ms  can  be  treated  differently  than 
messages  with  1-min  delivery  requirements.  An  HLP  should 
make  known  to  the  transport  system  the  type  of  service  it  de¬ 
sires.  For  example,  NVP  communications  will  request  uniform 
delay.  Interactive  streams  desire  short  delays,  modest  band- 
widths,  and  high  reliability.  Transmitting  a  file  requires  high 
bandwidth,  but  can  tolerate  long  transmission  delays.  In  this 
case,  packets  could  be  routed  far  afield,  taking  advantage  of  idle 
communications  lines,  incurring  long  delays  and  numerous 
hops,  but  still  provide  large  bandwidth  because  many  packets 
are  taking  different  long  routes  concurrently. 

Most  of  the  issues  described  in  this  section  concern  the  im¬ 
plementation  of  transport  facilities  in  an  operating  system  and 
not  the  design  of  a  communications  network.  As  a  conse¬ 
quence  of  layering,  important  issues  in  network  design  (nam¬ 
ing,  addressing,  message  sizes,  internetwork  operation,  routing) 
rarely  influence  HLP  design.  An  exception  is  our  plea  that  the 
communication  system  respond  to  “type  of  service”  requests 
from  applications. 

In  summary,  we  believe  1)  no  single  transport  facility  is  uni¬ 
versal,  2)  transport  systems  that  are  assembling  messages  from 
smaller  items  need  to  transmit  partial  messages  in  response  to 
time  constraints  and  precise  instructions  from  the  sender,  and 
3)  the  transport  needs  of  the  application  should  be  specified 
to  the  packet-switching  network  in  order  to  provid  die  type 
of  service  desired. 

D.  Implementation 

For  HLP’s  to  be  useful  in  a  resource-sharing  network,  imple¬ 
mentations  must  be  devised  for  many  of  the  computers  in  the 
network.  The  ease  with  which  these  implementations  can  be 
constructed  is  often  a  major  goal  of  an  HLP  design.  Although 
most  HLP’s  could  be  implemented  easily,  programming  en¬ 
vironments  of  many  computers  unfortunately  complicate 
implementations.  Many  operating  systems  incorporate  net¬ 
work  access  as  an  afterthought  and  therefore  provide  cumber¬ 
some  facilities.  Many  have  poor  support  for  cooperating 
processes,  which  are  used  frequently  in  HLP  implementations. 
And  finally,  tools  for  debugging  asynchronous  processes  are 
usually  poor.  In  a  network,  the  problem  is  exacerbated  by  the 
inability  to  examine  the  state  of  the  cooperating  process  or  to 
repeat  the  conditions  that  caused  an  error.  Even  stopping  the 
program  being  tested  may  be  difficult:  the  other  process  may 
detect  prolonged  inactivity  and  abort! 

Implementation  and  debugging  can  be  aided  by  measures 
taken  in  the  design  of  the  HLP.  Commands  to  reset  both 
processes  to  an  initial  state  are  essential;  facilities  to  report 
errors  precisely  can  be  provided.  Sometimes  explicit  de¬ 
bugging  aids  can  be  built  into  an  HLP  implementation -a 
version  of  the  NGP  display  process  that  types  out  a  trace  of  all 
protocol  commands  received  and  sent  greatly  speeds  debugging. 
Layering  of  protocols  also  eases  implementation.  If  an  HLP  is 
implemented  on  top  of  a  transport  system  deemed  to  transmit 


streams  of  data  reliably,  many  of  the  possibilities  for  timing- 
related  errors  are  removed  from  the  HLP. 

IV.  High-Level  Protocols  and  High-Level 
Languages 

The  finale  of  this  survey  of  issues  surrounding  the  design  and 
implementation  of  HLP’s  relates  the  structure  of  HLP’s  to  that 
of  HLL’s,  and  suggests  techniques  used  in  HLL’s  that  can  be 
applied  fruitfully  to  problems  in  HLP’s.  In  addition,  HLP’s 
point  to  some  areas  that  should  receive  increased  attention  in 
HLL  development. 

A.  Why  an  HLP  Protocol  Is  Like  An  HLL 

HLL’s  and  HLP’s  are  both  abstract,  but  precise,  definitions 
of  a  computation  or  communication  that  can  be  translated 
into  the  primitive  operations  actually  provided  by  a  computer 
or  communication  system.  HLL’s  were  bom  of  a  desire  to 
express  algorithms  in  a  language  natural  to  the  application;  a 
compiler  fills  in  many  details  as  it  translates  the  program  into 
the  machine  language  of  a  particular  computer.  Similarly,  an 
HLP  is  most  clearly  expressed  in  a  “language”  that  relates 
closely  to  the  application;  the  tedious  details  of  coding  and 
transport  are  not  the  primary  concerns  of  the  HLP  designer 
and  implemented 

One  of  the  powerful  ideas  in  HLL’s  is  the  concept  of  type:  a 
method  for  defining  abstract  objects  such  as  integers,  charac¬ 
ters,  arrays,  lists,  text  strings,  records,  sets,  etc.  A  compiler, 
together  with  type  definitions,  chooses  a  way  to  represent 
these  objects  within  the  physical  memory  structure  of  the 
computer.  With  this  aid,  the  programmer  is  freed  from  the 
details  of  encoding  various  objects  in  memory,  and  concen¬ 
trates  on  harnessing  these  objects  to  ease  the  construction  of 
his  program. 

The  concept  of  type  is  just  as  prevalent  in  HLP’s.  Usually  a 
message  is  divided  into  “fields,”  sequences  of  bits  that  are 
assigned  particular  meaning.  But  to  the  HLP  designer,  these 
fields  are  simply  encodings  of  types  used  in  the  application 
program:  integers,  characters,  etc. 

Data  structure  declarations  thus  play  a  key  role  in  both 
settings.  They  structure  memory  for  the  HLL;  they  structure 
messages  for  the  HLP.  A  concrete  example  from  the  plotter 
protocol  will  further  illustrate  this  point.  The  protocol  is  a  se¬ 
quence  of  commands  (a  type);  each  command  specifies  a 
function  (a  type)  and  possibly  some  arguments  (of  various 
types).  Using  PASCAL  [171,  we  would  write: 

type  PlotterFunction  *  (fBeginPlot.fEndPlot.fUpTDown, 
fLineTo); 

type  PlotterCommand  *  record 
case  function:  PlotterFunction  of 
fBeginPlot:  () ; 
fEndPlot:  ()  ; 
fUp:  0  ; 
fDown:  ()  ; 
fLineTo:  (x,y  :  int) 
end\ 

type  PlotterProtocol  =  file  of  PlotterCommand 

PlotterFunction  is  a  type  that  can  take  on  five  values  corre¬ 
sponding  to  the  names  fBeginPlot,  fEndPlot,  fUp,  fDown,  and 
fLineTo.  The  key  type  is  PlotterCommand,  which  is  a  variant 
record  with  an  entry  giving  the  function  and  possible 
arguments-it  thus  specifies  the  structure  of  a  single  command 
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in  the  plotter  HLP.  The  type  PlotterProtocol  is  a  sequence  of 
such  commands. 

The  application  program  might  generate  the  plotter  HLP  by 
calling  five  procedures  corresponding  to  the  five  commands. 
For  example,  the  procedure  LineTo  builds  a  PlotterCommand 
record  consisting  of  the  function  name  (fLineTo)  and  two 
argument  values.  The  record  is  then  “transmitted”  by  ap¬ 
pending  it  to  the  PlotterProtocol  sequence: 

var  c  :  PlotterCommand; 

var  pp  :  PlotterProtocol; 

procedure  LineTo  (tx,  ty  :  int ); 
begin 

c.function  :=  fLineTo;  (•  Build  command  •) 
c.x  :=  tx;  (•  Insert  arguments  •) 
c.y  :=*  ty  ; 

ppt  :=  c;  put(pp);  (•  Add  command  to  sequence  •) 
end 

Notice  how  type  declarations  and  record-construction  state¬ 
ments  help  specify  coding  and  transport  details. 

The  paradigm  illustrated  above  is  a  common  one  in  HLP’s: 
transmitting  records  consisting  of  a  command,  followed  by 
values  of  arguments.  It  is  no  coincidence  that  these  records 
correspond  to  the  specifications  of  typed  objects  in  a  pro¬ 
cedure  call;  the  declaration  of  the  procedure  LineTo  and  of 
the  fLineTo  variant  of  the  PlotterCommand  record  are  strik¬ 
ingly  similar.  The  plotter  protocol  and  the  NGP,  which  use 
this  paradigm,  are  essentially  transmitting  procedure  names 
and  arguments  to  the  receiving  process,  which  calls  correspond¬ 
ing  procedures.  Both  protocols  could  be  expressed  in  terms  of 
a  “procedure  call  protocol,”  an  HLP  that  arranges  to  achieve 
this  effect  [  39]  .*  HLP’s  with  this  flavor  arise  quite  frequently 
when  a  single  application  program  consisting  of  several  mod¬ 
ules  is  distributed  in  a  network.  It  is  precisely  this  methodology 
that  we  advocated  for  developing  the  NGP. 

Not  all  HLP  designs  are  conveniently  expressed  within  a 
framework  as  specialized  as  a  procedure  call  protocol.  The 
NVP,  for  example,  does  not  fit  well  into  this  category.  How¬ 
ever,  the  concept  of  structuring  communication  into  records 
by  using  appropriate  type  declarations  remains  extremely 
valuable.  These  type  declarations  must  of  course,  be  shared 
by  the  sender  and  receiver  to  insure  both  parties  apply  the 
same  interpretations  to  messages. 

Although  the  idea  of  type  is  perhaps  the  most  important, 
other  aspects  of  HLL’s  correspond  to  HLP  features.  When  two 
modules  are  linked  together,  communication  methods  are 
established  between  the  modules  (usually  through  procedure 
linkages),  much  the  way  a  connection  establishes  a  communi¬ 
cation  between  two  processes.  The  act  of  binding  the  two 
modules  together  might  be  compared  to  the  negotiation  or 
inquiry  used  to  inaugurate  communication,  though  most  HLL 
binding  is  considerably  less  flexible  than  HLP  parametrization. 
The  data  abstraction  techniques  in  HLL’s  foster  methods  for 
designing  software  systems  that  are  similar  to  the  abstractions 
required  to  achieve  device  independence  or  to  define  standards. 
Finally,  HLL’s  are  increasingly  concerned  with  methods  for. 

’  Strictly  (peaking,  s  "procedure  call"  requires  that  control  not  return 
to  the  caller  until  the  procedure  ha  been  executed.  In  the  plotter  and 
NGP  protocols,  it  is  desirable  to  let  the  calling  process  continue  execu¬ 
tion  so  that  several  "procedure  call”  records  can  be  packed  together 
into  one  network  message.  If  the  procedure  being  called  returns  an 
answer,  the  calling  process  mutt  of  course  wait  for  the  answer. 
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specifying  precisely  the  interface  to  software  modules,  in  part 
so  that  uses  of  the  module  by  other  programs  can  be  checked 
carefully.  This  problem  of  precise  description  is  also  key  to 
HLP's;  an  accurate  specification  of  the  HLP  for  a  service  must 
be  available  to  any  programmer  writing  programs  that  use 
the  HLP. 

B.  How  HLL  i  Can  Help  HLP's 

Our  sketch  of  the  relationship  between  HLL’s  and  HLP’s 
shows  that  many  ideas  developed  for  HLL’s  address  correspoiid- 
ing  needs  in  HLP’s.  However,  to  be  fully  useful  in  implement¬ 
ing  HLP’s,  some  of  these  ideas  must  be  developed  further. 

1)  Type  definitions:  Currently,  HLL’s  use  type  definition 
facilities  that  are  too  loose  and  dependent  on  details  of  the 
computer  on  which  they  execute.  For  example,  an  “integer” 
will  designate  36  bits  on  some  machines,  32  on  others,  16  on 
still  otheis,  etc.  This  definition  is  too  vague  to  be  useful  in  a 
heterogeneous  network,  i.e.,  one  containing  computers  of 
many  different  sorts.  Instead,  we  need  to  define  integers  pre¬ 
cisely,  e.g.,  “24-bit  2’s  complement  integer.”  Also,  most  struc¬ 
tural  definition  facilities  will  not  allow  us  to  split  a  field  over  a 
word  boundary  in  the  computer.  Such  a  facility  may  be 
needed  when  interpreting  a  standard  protocol,  even  though  it 
might  be  folly  within  any  given  machine.  Thus  programming- 
language  techniques  for  defining  structure  on  word-oriented 
storage  should  be  extended  and  applied  to  defining  structure 
in  bit-oriented  protocol  streams. 

2!  Interprocess  communication  ( IPC ]:  Many  newer  HLL’s 
are  beginning  to  offer  facilities  for  IPC,  some  using  the  idea  of 
transmitting  typed  records  as  we  showed  above  [20).  How¬ 
ever,  none  provides  automatically  the  vital  link  between  the 
variant  record  and  the  procedure  call.  The  programmer  can 
write  the  obvious  encoder,  as  demonstrated  in  the  PASCAL 
example,  and  a  matching  decoder  that  collects  arguments  and 
dispatches  to  call  the  proper  procedure,  but  these  are  bulky 
and  tedious  to  write.  This  sort  of  IPC  is  common  in  networks, 
but  is  neglected  in  programming  systems.  Some  efforts  have 
been  devoted  to  this  problem  [11],  [  16] ,  but  the  need  has  not 
been  widely  recognized. 

3)  Linking:  HLL’s  must  be  prepared  to  check  types  used  in 
IPC  when  connections  are  made.  Ideally,  a  process  offering  an 
HLP  service  could  distribute  (in  a  standard  form!)  a  definition 
of  the  protocol  to  receive  service.  Such  a  definition  could  be 
checked  against  the  protocol  definition  of  a  potential  partner 
when  a  connection  is  initiated— this  is  similar  to  checking 
caller/callee  parameter  declarations  when  linking  separately 
compiled  programs;  we  insist  that  definitions  match  before 
we  allow  communication. 

4)  Asynchrony:  We  need  better  ways  to  document  message 
sequences  and  message  exchanges.  As  in  any  multiprocessing 
system,  we  must  worry  about  deadlock  and  synchronization 
issues.  They  may  be  especially  tricky  if  we  use  a  transport 
mechanism  that  sometimes  delivers  messages  out  of  sequence 
or  occasionally  loses  them.  Assertions  provided  with  the  pro¬ 
gram,  and  flow  analysis  undertaken  by  a  compiler,  can  help 
with  certain  communications  problems:  can  a  particular  mes¬ 
sage  be  packed  with  others  to  form  a  larger  package  to  trans¬ 
port  through  the  network,  or  must  it  be  sent  immediately  be¬ 
cause  a  reply  is  required?  Efforts  in  this  area  are  beginning 
18],  [34], 

Sj  Documentation:  There  is  also  a  need  for  a  definition  at  a 
more  functional  level:  What  are  the  services  of  the  protocol? 
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What  effect  does  each  command  have?  How  do  the  commands 
interact?  What  are  possible  responses  to  particular  messages? 
It  seems  clear  that  a  computer  program,  perhaps  expressed  as 
abstract  operations  of  a  process  that  implements  the  protocol, 
is  a  better  documentation  tool  than  is  prose.  The  data  abstrac¬ 
tion  languages  now  emerging  [IS],  [21],  [30],  are  working 
toward  such  definitions.  Even  using  existing  HLL’s,  a  simple 
exemplary  implementation  of  the  NGP  might  be  a  much  better 
definition  of  its  properties  than  is  a  bulky  document. 

Ideally,  these  goals  can  be  addressed  in  an  integrated  fashion. 
A  new  project  of  Feldman,  Low, and  Rovner  [9] ,  [  10] ,  tackles 
some  of  them  with  the  programming  language  PLITS,  which 
integrates  messages  and  message  structures  into  an  HLL  and  a 
distributed  environment  DSYS,  which  provides  the  binding 
and  debugging  facilities  needed  to  develop  applications  that 
use  several  communicating  processes. 

Developments  that  truly  integrate  protocols  into  program¬ 
ming  languages  and  tackle  interprocess  communication  among 
processes  written  in  different  programming  languages  would 
open  up  many  new  opportunities  for  users  of  HLP’s.  Programs 
using  the  protocols  would  be  easier  to  implement  and  easier  to 
change.  The  protocols  themselves  could  be  changed  more 
readily,  and  could  be  described  by  precise  definitions  as  well  as 
lucid  implementations. 


V.  Conclusion 

The  emergence  of  resource-sharing  networks  and  the  HLP’s 
that  exploit  them  has  aroused  public  interest  in  standards. 
Although  a  standard  protocol  is  substantially  harder  to  design 
than  is  a  set  of  private  communication  conventions,  the  reward 
is  enormous:  the  ability  to  communicate  effectively  with  a 
vast  number  of  computer  resources.  The  desire  to  generalize 
an  HLP-to  increase  its  scope  to  handle  as  many  uses  as 
possible- is  countered  by  increased  difficulty  in  achieving  a 
standard.  The  search  for  simple  effective  standard  HLP's  is 
still  in  its  infancy-we  can  expect  that  HLP  development  will 
continue,  and  can  hope  that  further  principles  and  methodol¬ 
ogies  will  emerge  to  help  design  these  protocols. 

Although  they  have  received  increased  attention  because  of 
the  growth  of  computer  networks,  problems  of  standardization 
and  generalization  are  not  intrinsic  to  networks.  In  fact,  the 
presence  of  a  network  often  confuses  the  job  of  designing  a 
framework  to  provide  service  to  an  application  with  the  job  of 
transmitting  information  in  a  network.  Breaking  the  problem 
into  issues  of  language,  coding,  and  transport  is  a  help.  More 
importantly,  the  application  must  be  understood  well  in  simple 
environments,  free  from  standardization  constraints,  before  a 
standard  protocol  design  is  attempted. 

The  outlook  for  streamlining  specification  and  implementa¬ 
tion  of  HLP’s  is  closely  tied  to  the  development  of  HLL’s. 
Existing  HLL’s  provide  facilities  that  can  be  applied  to  simplify 
these  tasks.  The  important  next  step  is  for  the  language  de¬ 
signer  to  realize  that  programs  written  in  his  language  will  need 
to  communicate  with  other  programs  written  in  other  lan¬ 
guages,  on  computers  with  different  conventions.  Protocol 
definitions  must  be  shared  among  these  environments  in  order 
to  propagate  standard  protocols.  Although  it  is  a  difficult  goal 
to  standardize  even  a  simple  form  of  interprocess  communica¬ 
tion  over  a  wide  variety  of  computing  equipment  and  pro¬ 
gramming  systems,  ignoring  this  opportunity  will  curb  the  po¬ 
tential  of  resource-sharing  networks.  Ultimately,  HLL’s  could 
be  enhanced  to  the  point  that  techniques  used  to  implement 


HLP’s  will  be  expressed  as  naturally  as  arithmetic  expressions 
are  now. 
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•a  prSSBIt  a  snort  tutorial  survey  of  algo- 
ntnms  for  locating  and  identifying  spatially  dis¬ 
tributed  sources  and  receivers.  Tbe  empoasis  is 
on  aetnoos  tnat  are  eitner  considered  to  be  very 
basic  or  lend  tnemselves  potentially  to  distri¬ 
buted  computations,  tna  main  objective  of  tnis 
work.  «e  snail  also  very  briefly  outline  our  own 
approacn  to  tnis  set  of  proolems. 

1.  Introduction 

m  many  practical  proolems  it  is  necessary  to 
determine  tne  location  of  signal  (noise)  sources 
from  ceasurements  provided  by  one  or  more  sensors. 
Typical  applications  include: 

Acoustic  surveillance  systems  (e.g.,  sonar 
detection  of  wow  flying  aircraft), 

ieismic  arrays  for  seismic  exploration,  moni¬ 
toring  earthquakes  and  nuclear  explosions,  or 
detecting  vehicle  movements, 

-  Antenna  arrays  for  radio  astronomy  or  elec¬ 
tronic  surveillance  (e.g.,  direction  find¬ 
ing)  . 

-  .eultlple  radar  systems  for  detection  and 
tracking. 

Tne  diversity  of  applications  involving  tne 
target  location  proolem  makes  a  general  unified 
treatment  of  tnis  suoject  quite  difficult.  To 
proviae  some  focus  for  our  discussion  we  will  use 
tne  fo  — owing  sample  proolem: 

Consider  a  small  numoer  of  sensor  sites 
ipernaps  ten)  dlstrlouted  over  a  specified 
area,  a  numoer  of  targets  are  present  in  tne 
area  and  tneir  location  is  to  be  estimated 
cased  on  tne  data  collected  by  the  sensors. 
The  sensors  measure  signals  wnish  are  eitner 
eaitted  oy  tne  target  (tne  passive  case)  or 
reflected  oy  it  (tne  active  case  which 
requires  target  illumination).  By  processing 
tne  signals  provided  by  tne  sensor,  informa¬ 
tion  aoout  target  bearing  ana/or  range  can  be 
determined. 

.Sometimes  a  single  sensor  is  not  capable  of 
measuring  eitner  range  or  searing,  as  for  example 
with  omnidirectional  passive  sensors.  Combining 
lata  from  a  group  or  array  of  sensors,  nowever, 
saxes  it  possible  to  find  tne  desired  information. 


A  sensor  array  of  tnis  type  may  be  located  at  a 
single  site,  in  which  case  we  consider  it  as  one 
unit,  or  it  may  be  distributed  among  many  sites. 

In  otner  cases  tne  sensor  sites  can  provide 
different  types  of  target  related  data,  in  partic¬ 
ular: 

(i)  Range  only  (ranging  radar,  active  sonar) 

(ii)  bearing  only  (optical,  infrared  sensor, 
direction  finder) 

(iii)  bearing  and  range  (searcn,  tracking  radar) 

(iv)  Target  velocity  (uoppler  radars,  MTI) 

Different  data  types  lead  to  different  location 
estimation  tecnniques.  For  example,  range  only  or 
bearing  only  measurements  are  related  to  target 
association  tecnniques  (section  2.2).  Bearing  ana 
range  data  is  usually  associated  with  tracking 
algor ltnma  for  moving  targets. 

Data  from  a  single  omnidirectional  passive  sensor 
is  treated  oy  tima-of-arrlval  metnods  (section 
2.1)  or  oeamforming  and  array  processing  tecn¬ 
niques  (section  2.4).  The  estimation  metnod  also 
depends  on  the  type  of  signals  provided  by  the 
sensor  site:  conerent/nonconerent,  "raw"  or  fil¬ 
tered  data  (linear  processing),  data  after  detec¬ 
tion  (nonlinear  processing),  etc. 

Classical  methods  of  processing  sensor  data 
nave  generally  been  of  tne  centralized  type,  that 
is,  all  of  the  sensor  data  was  collected  at  one 
site  and  tnen  processed.  An  alternative  is  to 
process  mucn  of  the  data  at  tbe  collection  site 
and  to  send  only  tne  relevant  data  to  eitner  a 
central  site  or  (more  generally)  to  the  appropri¬ 
ate  user,  in  Section  3  we  snail  discuss  tne  dif¬ 
ferent  types  of  distributed  processing  and  ttteir 
ad vantages. 

in  tne  last  section  we  will  describe  our  own 
approacn  to  tne  development  of  distributed  algo¬ 
rithms  for  tne  estimation  of  position,  location 
and  otner  cnaracteristics  of  sensors  and  sources, 
irtorf  et  al.i.  We  will  give  a  snort  description 
of  sample  algorithms  of  a  fully  distributed  nature 
tnat  nave  desirable  features,  we  anal,  also  out¬ 
line  several.  of  tne  nonclassical  approaches  to 
solving  these  proolems. 

2.D  Tacunigues  for  estimating  target  location 


..ns  worx  was  supported  oy  ARFA  under  contract  .iLA  This  section  provides  a  brief  summary  of  tne 
j0;-7 s-C-0’ Vv .  solution  tacnnlques  associated  witn  tne  target 
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.ocation  problem.  Oniy  the  oasis  ideas  are 
presented;  tne  details  can  oe  found  in  tne  refer¬ 
ences. 

2.’  Cime-of- Arrival  estimation 


Ala.  of  tne  information  aoout  tne  target  loca¬ 
tion  are  encoded  m  the  relative  tiae-oelays  of 
tne  various  sensors.  To  see  this  consider  tne 

following: 


A  signal  emanating  froa  a  reaote  source  and 

measured  in  the  presence  of  noise  at  two  spatially 
separated  sensors  can  oe  modeled  as 

x,  lt)»s(t)*n,  (t)  I’a) 

x2u)*itslt*iMin2(t) ,  (16) 

■mere  s(t),  n,(t),  n,(t)  are  assuaed  to  6e  sta¬ 
tionary,  Independent,  randoa  processes.  One  com¬ 
mon  metnou  of  estimating  tne  time  delay,  0,  is  to 

compute  tne  cross  correlation  function 

3x,x2*£(x,(t)Xj(t-'W  }  (2) 

.t  follows  directly  tnat 

n*,x2cr)=*<<3s('e<-o) 

wnere  H  ( • )  is  the  signal  autocorrelation  func¬ 
tion.  *fn  important  property  of  autocorrelation 
functions  is  tnat  ajs;r)lfijs(0) .  Thus,  the  peax 
of  a  /p-O)  win  occur  at  tsO.  This  provides  us 
witn  a  way  for  finding  tne  delay  oy  calculating 
tne  astiaatea  cross-correlation  function,  if  s(t) 
is  a  unite  noise  source,  R  (T'-0)*4(f-0),  and  tne 
peax  will  So  snarpiy  defined,  in  general,  Rja( * ) 
will  oe  "spread  out"  whxcn  tends  to  oroaden  the 
peax,  maxing  it  more  difficult  to  pinpoint  the 
actua-  delay,  furtaermore,  men  multiple  targets 
(ana  multiple  delays)  are  present,  tne  "tails"  of 
tne  autocorrelation  functions  for  different  tar¬ 
gets  will  de  overiayea  ar.d  more  difficult  to 
separate.  Thus,  it  is  deslranla  to  preprocess  tne 
sensor  measurements  x, ,  x2  so  that  after 
cros3correiation  sharper  peaxs  will  result,  as  in 
fig.  ’.  In  tne  aosence  of  measurement  noise  this 
can  oe  lone  oy  passing  x.(t),  x2(t)  tnrougn  a 

"wnitening"  filter  for  s(t),  nence  tne  correlation 
of  s  is  removed.  men  noise  is  present,  tne 
filter  nas  to  taxe  into  account  ootn  signal  and 
noise  spectra. 


multiple 

correlator 


Fig.  2:  Target-sensor  geometry 


wnere  a,.  is  the  relative  delay  Oetween  sensor  i 
and  sensor  J,  and  c  is  the  propagation  velocity. 

r^=(xT-*1)a*(yT-y1)2,l  *  It; 

it  can  Be  snown  locnaidtj  tnat  for  »2j,  tr.a  sat  of 
equations  («),  (5)  can  os  rewritten  as  a  linear  set 
of  equations  for  x.  y„,  wnere  tne  coefficients  are 
mown  quantities*’  tl.e.  written  m  terms  of 
°ij  *1  F •  This  set  of  equations  can  now  oe 
solved ’to  determine  tne  target  location. 

Tne  discussion  above  indicates  tnat  one  way 
of  solving  the  target  location  proolem  is  to  first 
estimate  tne  time-of-arrivai  delays  ana  tnen  to 
compute  tne  location  oased  on  tne  geometry  of  tne 
proolem  [dannj.  it  is  possible,  of  course,  to 
combine  these  two  steps  and  develop  an  estimator 
airect.y  for  tne  target  coordinates  (x*.  y„ ,  or,  as 
j is  more  commonly  done,  for  its  oesrlng’ana  range. 
This  leads  to  alternative  estimator  structures, 
typically  using  tne  maximum  ilxelinood  ipproasn 
[Bangs  ana  donultaeissj ,  [MacDonald j. 

2.2  Target  Association  Tscnniques 


rig.  A  time-daily  estimator. 

Different  cnolces  for  tae  pre-fiiter  are  pos- 
sioie  cepenaing  upon  tne  performance  criterion 
cnosen  oy  tne  designer:  tne  lixeiinooc  function 
,.iann  and  Tretter:,  tne  deflection  function  [Ar.app 
ir.d  iarter[,  etc.  .t  sncuid  se  noted  tnat  several 
estimator  structures  oeaides  tne  mu-ilplier- 
correiator  estimator  cave  oeen  ievelopeo. 


A  special  type  of  proolem  arises  wnen  multi¬ 
ple  sensors  wnicn  measure  range  out  not  annum 
(or  vice  versa;  are  used  to  estimate  target  loca¬ 
tion.  if  oniy  a  single  target  is  present,  its 
location  is  found  oy  multiiateraticn .  For  exam¬ 
ple,  if  amimutn  measurements  frees  several  sensors 
are  available,  one  nas  dniy  to  compute  tne  inter¬ 
section  of  tne  various  .ir.es-cf-signt . 


Tne  situation  becomes  more  complex  wnen  mul¬ 
tiple  targets  are  present.  This  is  illustrated  by 

rig. 


rig  j :  Ibe  target  association  prooiem 


iacn  sensor  is  assumed  to  have  detected  tne 
two  targets  T,  However,  tnese  detections  are 
not  properly  associated,  it  is  not  known  wnlon 
measurement  of  aacn  sensor  corresponds  to  wnion 
target.  Thus,  it  is  necessary  to  associate  tar¬ 
gets  wltu  sensor  measurements  before  estimating 
tne  target  locations  (in  fact  tne  association  and 
.ocation  proolaos  are  addressed  simultaneously) . 
Note  also  tnat  if  taere  are  more  targets  tnan  sen¬ 
sors.  amoiguities  ("gnost"  targets)  may  result, 
.n  rig.  j,  if  was  not  tnere,  tne  measurements 
3f  i,  and  would  oe  consistent  witn  toe  assump¬ 
tion  mat  tne  targets  are  at  g,  g2  ratner  tnan  at 

:v<- 

several  3uneoes  nave  been  proposed  to  solve 
tne  target  association  problem,  ana  tney  are 
briefly  aescrised  below. 

list  Forming 

Pick  a  pair  of  sensors  and  compute  all  tne  inter¬ 
sections  of  tneir  iines-of-signt  to  potential 
targets  (i.e.  directions  In  wnion  tney  detected 
sometnlng) .  Tnese  intersection  points  are  poten¬ 
tial  target  locations,  now  pick  a  tniro  sensor 
ana  cnecit  wnetner  its  iines-of-signt  pass  tnrougn 
any  of  tne  intersection  points.  if  not,  delete 
tnese  points  from  tne  list  of  potential  targets, 
by  proceeding  tnls  way  witn  tne  otner  sensors,  tne 
-ist  will  finally  include  only  tnose  target  loca¬ 
tions  wnion  are  consistent  with  ail  the  observa¬ 
tions.  it  snouid  oe  empnasizea  tnat  tnis  is  a 
mgniy  simplified  description  of  more  realistic 
list  forming  algorithms. 

back-projection  or  dpaoe-iearou 

(he  space  to  oe  searcned  is  divided  into  ceils  of 
a  size  corresponding  to  tne  system  resolution. 
Tne  numoer  ’  is  adaed  to  those  ceils  of  tne  space 
wnicn  lie  aiong  tne  .me  of  signt  of  a  given  sen¬ 
sor  detection.  This  process  is  repeated  for  ail 
.ines-of-signt  of  ail  sensors.  As  can  oe  seen 
from  Fig.  a  the  target  vocations  can  oe  identified 
as  tnose  caving  tne  nlgnest  numoer  l*  tne  numoer 


of  sensors)  written  in  tnem.  Note  tnat  relatively 
nign  values  can  oe  ootained  at  locations  otner 
tnan  tnose  of  tne  real  target  ("gno3t"  -ocationsj . 


\  \ 


So 


Fig.  e:  Association  Dy  oacx-projeotlon 


Image  reconstruction  techniques 

It  is  possible  to  view  toe  sensor  measurements  as 
line  Integrals  tnrougn  an  image  consisting  of 
"points  of  lignt"  at  tne  target  locations.  The 
line  Integrals  are  over  the  infrared  emissivity 
map  (for  IK  sensors)  or  tne  radar  reflectivity  map 
(for  ranging  radars).  The  prooiem  of  reconstruct¬ 
ing  images  from  tneir  line-integral  projections 
nas  been  extensively  treated  in  literature  te.g., 
Brooks  and  01  Chira,  horn],  hecently  it  was  snown 
how  these  techniques  can  be  applied  to  the  target 
association  problem  iFriedlander  at  ai.,  Oen ton  et 
ai.j  oy  reconstructing  tne  "brightness  map"  of  tr.a 
area  under  surveillance  and  identifying  targets  as 
tne  "brignt  spots",  it  snouid  be  noted  tnat  the 
Image  reconstruction  metnoa  requires  that  tne  sen¬ 
sors  provide  tne  actual  energy  measured  in  eacn 
direction  (range)  and  not  just  target/no  target 
Information.  The  detection  taxes  place  after  pro¬ 
cessing  tne  information  from  all  tne  sensors;  in 
list  forming  ana  spaoe-searcn,  only  tne  results  of 
tne  detection  performed  at  eacn  individual  sensor 
are  passed  on. 

k.o  spectral  estimation 

Jiuiti  pie-sensor  measurements  can  oe  con¬ 
sidered  as  sampies  of  a  time-space  function  yit.ij 
where  ?  represents  a  point  in  j-o  space.  Tne 
notions  of  (temporal)  correlation  function  ana 
(temporal)  speotrai  density  can  be  extended  to 
time-space  functions  of  tins  type.  «e  aefir.e  a 
random  fie.d  y(t,£;  as  stationary  ana  homogeneous 
if  iiyi  t,£; )sj  ana 


rUt,t'  ,£.,£•)  *&ty(t,£Jy(t'  ,£.')) 

=«(♦,£)  (b) 

wnere 

tst-c',  z* 


Any  nomogeneous  random  field  naa  a  spectral 
representation 

yU,j»e  £j,jj>rl(«<*'£,z(a*<tg),  (?) 

where  g*(x,  ,* ,,*-) ,  £*(x,y,z)  and  s(£n£|t)  la  a 
random  function  uitn  certain  properties.  The 
correlation  function  can  oe  represented  by 

rtCtr.r)* 

J  fj J’e'^0**’*  am  die  (d) 

where  ?  is  the  spatial-temporal  spectral  density. 
«s  in  tne  temporal  case,  an  inversion  formula 

colds : 

Ufj  -«e 

H(*,£)  atr  dt.  (9) 

is  an  important  example,  consider  a  mono¬ 
chromatic  (single  frequency)  plane  wave  at  tem¬ 
poral  frequency  ol  propagating  in  tne  direction 
given  oy  a  unit  vector  i  at  velocity  a.  ouch  a 
wave  is  represented  Oy 

y(t,£)*exp(jui)teg0'2J  (’Oa) 

wnere 


.t  is  easy  to  verify  that  for  tnis  space-time 
function  we  nave 

-J(aut*gQ-.£) 

«(♦,£)«  J  ("a) 

and 

t>(i»,.g)*£(i»-«»3,4-*0)  ("Oj 

wnicn  is  a  delta  function  located  at  temporal  fre¬ 
quency  m 3  ano  spatial  frequency  ,or  wave  numeer) 
<c3.  ibis  example  indicated  now  P(m,v)  provides 
information  regarding  tne  direction  (and  velocity) 
of  propagation  of  waves. 

The  point  of  tnis  discussion  is  that  (spa¬ 
tial)  spectral  estimation  is  a  way  of  estimating 
target  searing,  since  if  we  p.ot  f(m,g;  in  the  a 
plane  (for  a  fixed  m),  it  will  tenc  to  oe  concen¬ 
trated  around  tne  point  g  wnicn  corresponds  to  tne 
direction  of  tne  wave  propagation  ana  nence  tne 
searing  of  tne  target  (tne  source  of  tnese  waves). 


Thus,  target  searing  estimation  reduces  to  tne 
problem  of  estimating  P(i*,.g)  from  the  measurements 
y^iiDj),  wnere  r,  represents  sensor  locations  ana 
t  are  the  sampling  times  of  tne  output  of  tnat 
sensor. 

Many  spectral  estimation  teenniques  nave  been 
used  in  this  context;  several  are  described  in  tne 
references. 

2.4  seamforming  and  Array  Processing 

Pernaps  tne  moat  common  operation  in  processing 
signals  in  a  sensor  array  is  that  of  beamforming. 
Antenna  arrays  (for  radar,  communication,  etc.) 
and  acoustic  sensor  arrays  (sonar)  are  typical 
examples  of  beamforming,  aeamforalng  consists  of 
a  summation  of  time-delayed  (or  pnase-sniftec) 
versions  of  tne  sensor  outputs,  l.e. 

N 

Z(t)»  l  y(t-t .,£.),  (’2) 
1*1  1 

wnere  t^  represents  the  time-oeuay  for  sensor  1 
and  ?i  its  location.  Proper  cnoice  of  tne  celays 
t,  enhances  toe  signals  received  from  a  particular 
direction  and  attenuates  signals  from  otner  direc¬ 
tions.  This  operation  ‘ *  tn»  .patial  equivalent 
of  a  temporal  narrow  >  filter. 

The  output  Z(  t)  the  oeamformer  is  a 
(scaiar)  time  function,  wnich  is  processed  so  as 
to  obtain  a  measure  of  tne  signal  energy  in  an 
optimal  way.  The  most  common  processing  senemes 
include 

Ms toned  filtering 

Wiener  filtering  or  least-squares  estimation 

Maximum  livelihood  estimation 

This  linear  filtering  is  often  followed  oy  a 
nonlinear  operation,  e.g.  squaring  and  integra¬ 
tion.  Tne  order  of  linear  filtering  ana 
transforming  (also  a  linear  operation)  is  often 
reversed,  the  exact  structure  depenaing  on  tne 
application.  Toe  implementation  of  tnese 
processes  is  usually  done  in  tne  rourier  domain 
(with  pnase  shifts  replacing  time  deiays)  out  time 
domain  implementation#  are  also  usea.  *  sample  of 
the  vast  literature  on  beamforming  ana  tne  associ¬ 
ated  signal  processing  (refened  to  as  "array  pro¬ 
cessing")  is  given  in  tne  bibliography . 

A  ciass  of  array  processor  of  particualr 
interest  are  tne  different  types  of  adaptive 
arrays.  The  need  for  adaptive  arrays  arises  for 
many  reasons.  Some  examples: 


Null  steering,  to  minimize  interference  from 
sources  other  tnan  tne  target  of  interest. 

Adaptive  filtering,  to  candle  unxnown  noise 
and  signal  statistics. 

Adaptive  beamforming.  Seamformicg  requires 
precise  xnowledge  of  the  sensor  locations 
(witnm  fractions  of  a  waveisngthj  in  order 
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tnat  tne  3 tearing  delays  oe  computed.  rela¬ 
tively  small  errors  can  lead  to  serious  per¬ 
formance  degradation.  Thus,  unan  sensor 
locations  are  imprecisely  <cnown  or  are  con¬ 
stantly  changing,  a  fixed  processing  scneme 
is  infeasioie. 

•.  Ine  .ieea  for  Distriouted  Computation 

mere  are  many  advantages  in  districting 
computations  for  a  large  sensor  networx;  some  of 
tne  main  arguments  are  tne  following: 

v ’ )  Reduction  of  Computational  Complexity 

Distributee  processing  is  often  used  as  a 
means  for  solving  proolems  related  to  large-scale 
systems.  I'nis  approcn  leads  to  tne  decomposition 
of  a  nign-dimensionai  proolem  into  a  sequence  of 
soaner-aimensional  ones.  I'nis  often  results  in 
considerable  computational  savings;  many  fast 
digoritnms,  sucn  as  tne  Fast  Fourier  Transform, 
are  of  tnis  type.  Also,  certain  large-scale  prob¬ 
lems  simply  cannot  oe  solved  in  a  direct  manner 
(e.g.,  inversion  of  very  large  matrices)  and  ways 
nave  to  ae  found  to  decompose  tne  problem  into 
smal-er  parts  tnat  can  oe  nandied.  mis  can,  of 
course,  oe  cone  in  a  centralized  manner,  but  tne 
distributed  approach  often  leads  to  natural  decom¬ 
positions  and  valuable  insignts. 

neiiaoliity 

Distributed  systems  nave  good  properties 
from  a  reliability  standpoint  due  to  taeir 
innerent  parallelism.  Failure  of  a  computational 
module  does  not  necessarily  result  in  system 
failure  since  tne  computational  load  can  be  re¬ 
distributed  among  tne  remaining  modules.  Thus,  a 
distributed  system  may  nave  tne  ability  to  recon¬ 
figure  and  continue  operation,  depending  on  tne 
type  of  tne  system  and  its  structure,  its  opera¬ 
tion  after  reconfiguration  may  oe  at  a  reduced 
performance  level.  (Tins  would  Oe  tne  case  if  tne 
remaining  computational  resources  were  insuffi¬ 
cient  to  complete  tne  solution  of  tne  prooiem,  or 
if  tne  loss  of  a  computational  module  was  associ¬ 
ated  witn  tne  loss  of  a  sensor  site.)  Toe  system 
displays  graceful  degradation  of  performance, 
union  contrasts  witn  tne  "catastropnic"  failure 
mode  of  centralized  systems. 

lu)  fiexiollity 

The  distributed  nature  of  computations  is 
often  associated  with  distributed  system  struc¬ 
tures;  sucn  a  system  structure  aignt  Oe  a  collec¬ 
tion  of  sensor/cooputer/ communication  modules 
interconnected  in  a  networx.  This  organization 
leads  to  a  very  flexlole  structure,  possessing 
desiraoie  properties  wnicn  are  not  always  present 
in  a  centralized  system: 


iasy  system  groutn 

Tne  capcity  to  nano.e  topological  cnanges  m 
me  svstem  structure  !e.g.,  adding  or  delet¬ 


ing  nodes  during  maintainance  without  inter¬ 
rupting  system  operation) . 

-  The  possibility  of  incorporating  many  comoi- 
nations  of  resources,  witn  variao*e  perfor¬ 
mance  levels,  as  determined  ay  tne  neecs  of 
eacn  user. 

-  The  combination,  in  a  3ingie  networx,  of  many 
types  of  information  3ansors. 

**.  Dur  Approacnes 

Distributed  processing  has  oy  now  secome  a 
term  tnat  is  applied  to  many  types  of  systems  ana 
is  not  very  weil  defined,  since  tne  distnoutaa 
sensor  net  prooiem  can  ae  weal  descriaeu,  see  e.g. 
[ill  DON  Report;,  we  snail  use  it  as  a  oasis  for 
defining  cistributed  processing.  «e  consider 
three  computational  organizations  wnicn  cou*a  ae 
used  in  tnis  context:  centralized,  independently 
distriouted,  and  cooperatively  distributee. 

')  Centralized  --  all  sensor  cata  13  passed 
to  a  central  sita,  wnere  computation  is  per¬ 
formed,  and  tne  pertinent  resuits  are  men 
returned  to  tne  appropriate  remote  sites. 

2)  independent  --  all  sensor  information  is 
communicated  to  every  otner  site,  anc  eion 
site  then  maxes  its  oest  estimate  of  tne 
environment. 

3)  Cooperative  --  sites  exchange  processes 
information,  ana  at  most  partial  sensor  iata. 

It  is  tne  second  and  tnird  organizations  tnat  are 
normally  referred  to  as  distributed  organizations, 
and  cne  chira,  in  particular,  tnat  we  consider  to 
be  of  greatest  interest. 

The  dearen  for  Distributed  algorithms 


Centralized  algorithms  are  now  quite  we*l 
understood,  as  a  perusal  of  tee  extensive  litera¬ 
ture  indicates,  nowever,  very  few  attempts  nave 
oeen  made  to  unify  and  integrate  ail  tue3e  :if- 
ferent  approaches  ano  results.  A  typical  coox  on 
radar  or  sonar  signal  processing  is  a  ratner  ad 
hoc  collection  of  data,  metnods  ana  tneory  .remin¬ 
iscent  of  a  coox-ooox) .  .0  an  outsider  of  tnis 
field  it  is  extremely  difficult  to  get  a  coherent 
picture  and  to  maxe  intelligent  onoi ces  :n  apply¬ 
ing  tnese  metnods  in  tne  design  of  systems.  « 
systematic  representation  of  tnis  <r.ow*ege,  oy 
itself  a  tremendous  tasx,  is  required  in  orcer  to 
maxe  effective  use  of  tne  availaoie  alternatives. 
It  is  very  tempting  to  suggest  tne  csveiopment  of 
an  "expert  support  system"  ocaoming  and  extsr.dlng 
recent  approaches  in  Ai,  aata-oase  management  and 
reiatea  fields. 

cur  approacn  to  tne  development  of  .coopera¬ 
tively)  distriouted  algorithms  can  oe  summarises 
under  tne  following  headings: 

Partitioning  of  optima*  centra-imsc 
algorithms,  sucn  as  .taximum-i.i«.e.:ncoc, 
ixtenaec  i\a»man  filtering,  anc  team 
Forming.  This  aDproacr.  id  cse:**  -r.er. 
tr.e  system  under  consideration  car. 


divided  into  subsystems  with  sparse 
interaction. 


application  and  extension  of  metnoos 
developed  in  tne  context  of  aecentral- 
izea  control  and  estimation,  e.g.  team 
decision  tneory  and  differential  games, 
nierarcnical  and  multilevel  ay sterna, 
aggregation  aetnoda,  singular  perturba¬ 
tion  and  otner  perturbation  tecnniques, 
perioaic  coordination  and  spatial 
dynamic  programming  (adp). 

Development  of  new  optimal  distributed 
algorltma  for  specific  subsets  of  sensor 
data.  For  example, 

-  Time/Frequency  Difference  uf 
arrival  (TiluA/FOUA)  data,  using 
ARKA  modeling. 
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Detection  theory  problems  are  characterized 
by  the  fact  that  one  must  decide  which  of 
several  alternatives  are  true,  for  example,  if 
there  are  two  alternatives,  if  a  target  is  there 
or  not,  the  problem  is  a  binary  detection  prob¬ 
lem.  In  estimation  theory  problems,  which  very 
closely  parallel  those  of  detection  theory,  one 
must  estimate  parameters  of  a  signal  or  situa¬ 
tion.  For  example,  one  type  of  signal  estimator 
or  receiver  estimates  values  of  a  continuous 
parameter.  It  will  rarely  compute  the  exact 
signal,  but  attempts  to  be  close  most  of  the 
time. 


The  problems  in  which  detection  and  estima¬ 
tion  theory  are  utilized  can  be  categorized  into 
three  levels  or  a  hierarchy. as  shown  in  Figure 
1.  Level  1  problems  include  detecting  and  es¬ 
timating  Known  signals  in  noise;  level  2,  sig¬ 
nals  with  unknown  parameters  in  noise;  and  level 
3,  random  signals  in  noise.  The  particular 
problems  that  will  be  discussed  in  more  detail 
in  this  note  are  marked  with  an  asterisk. 


The  general  detectlon/estimatlon  problem  is 
more  clearly  explained  with  the  aid  of  Figure  2. 
The  real  situation  of  interest  Includes  the 
presence  or  absence  of  a  target;  its  charac¬ 
teristics,  e.g.,  its  reflectivity  or  noise  emis¬ 
sions;  and  its  parameters,  e.g.,  range,  azimuth, 
and  velocity.  Noise  such  as  ground  clutter, 
weather  formations,  or  background  acoustic  noise 
corrupts  the  target  information  as  it  propagates 
from  the  area  of  interest  to  the  sensor.  The 
sensor  transforms  the  information  presented  to 
it  into  measurements  or  points  in  observation 
space,  for  example,  power  levels  in  azimuth  or 
range  bins.  When  the  observation  spaoe  has  a 
finite  number  of  dimensions,  the  problem  is  a 
classical  detection  theory  problem.  The  detec¬ 
tion  statistic  calculator  processes  the  measured 
data  to  produce  appropriate  statistics  so  that 
target  detection  and  parameter  estimation  can 
best  be  carried  out.  The  results  are  the  deci¬ 
sion,  and  if  positive,  estimates  of  target 
metrics. 


For  illustrative  purposes,  consider  the 
simple,  idealized  example  in  which  the  presence 
of  a  target  is  indicated  by  a  d.c.  level  of  'd' 


volts;  the  larger  the  target,  the  greater  the 
voltage  level.  Absence  of  the  target  is  indi¬ 
cated  by  0  volts.  Zero-mean  Gaussian  noise  is 
added  to  the  voltage  level  and  the  sensor  takes 
samples  of  the  sum  of  the  signal  level  and  the 
noise.  The  observation  space  consists  of  the 
values  of  the  sampled  voltage.  A  possible 
detection  rule  is  to  compare  values  of  the  sam¬ 
pled  voltage  to  a  threshold,  i;.  If  the  sample 
value  is  greater  than  i;,  a  target  1s  declared  to 
be  present;  otherwise  the  ares  of  interest  is 
declared  to  be  empty.  The  two  fundamental  er¬ 
rors  that  result  in  this  example  and  in  detec¬ 
tion  theory  is  general  are  false  alarms,  declar¬ 
ing  that  a  target  is  present  when  it  is  not,  and 
false  dismissals,  deolarlsg  no  target  when  one 
is  present.  One  usually  wants  to  maximize  the 
probability  of  detection,  P_  ,  without  signifi¬ 
cantly  increasing  the  probability  of  a  false 
alarm,  P_.  For  this  simple  example,  PQ  is  plot¬ 
ted  against  P_  with  d  as  a  parameter  in  Figure 
3.  In  the  figure,  the  actual  threshold  used  in 
the  decision  rule  is  proportional  to  q.  The 
detection  probability  is  plotted  against  the 
voltage  level,  d,  with  P_  as  a  parameter  in 
Figure  4.  Both  figures  illustrate  that  there  is 
a  trade-off  between  the  two  major  errors.  If 
one  lowers  the  threshold  to  detect  smaller  tar¬ 
gets,  one  must  contend  with  a  higher  false  alarm 
rate.  Alternately,  if  one  increases  the  thres¬ 
hold  to  keep  the  false  alarm  rats  low,  one  must 
accept  a  lower  probability  of  detection. 


II.  J2S&  strawman  Small  AfiPlUtlC  Arrays 
1.  Introduction. 

The  DSN  strawman  acoustic  sensor,  a  small 
acoustic  array,  has  been  introduced  in  an  ear¬ 
lier  paper,  Lacoas,  "Tutorial  on  Sensor  Technol¬ 
ogy,"  (these  proceedings).  The  signal  process¬ 
ing  that  must  be  performed  to  calculate  ap¬ 
propriate  statistics  to  detect  and  estimate  tar¬ 
get  parameters— high  resolution  frequency- 
wavenumber  analysis— is  briefly  described  here. 
In  the  case  of  a  small  acoustic  array,  the  tar¬ 
get  metric  that  is  estimated  is  the  acoustic  az¬ 
imuth.  The  acoustic  azimuth  is  the  true  azimuth 
of  the  target  at  the  time  the  acoustic  signals 
being  processed  were  emitted  from  the  target. 
Because  the  propagation  velocity  of  sound  is  of 
the  same  order  of  magnitude  as  the  velocity  of 
the  target,  the  target  has  usually  moved  a  con¬ 
siderable  distance  by  the  time  the  acoustic  sig¬ 
nals  propagate  to  the  sensor.  The  statistic 
that  is  calculated  from  the  array  data  is  power 
as  a  function  of  frequency  and  wavenumber.  An 
explanation  of  how  this  result  esn  be  interpret¬ 
ed  for  target  detection  and  azimuth  estimation 
is  first  presented.  Next  the  high  resolution 
algorithm  is  described  and  its  signal  processing 
steps  are  enumerated.  Finally  some  detection 
and  estimation  problems  are  discussed. 


2.  Frequency-Wavenumber  Analysis. 

High  resolution  frequency  wavenumber 
analysis  performs  frequency-domain  beamforming 
on  the  array  data  with  the  additional  feature 
that  output  noise  is  minimized.  Conventional 
beamforming  and  the  resulting  target  detection 
and  estimation  of  acoustic  azimuth  is  illustrat¬ 
ed  by  considering  the  acoustic  sensor,  a  planar 
array  of  microphones.  A  plane  acoustic  wave  im¬ 
pinges  upon  the  array  along  an  elevation  angle  6 
from  the  vertical,  and  from  the  horizontal  az¬ 
imuth  9.  Although  the  wave  propagates  at  the 
velocity  of  the  medium,  v,  it  traverses  the  ar¬ 
ray  at  the  apparent  velocity,  C  *  v/sind.  The 
apparent  velocity  ranges  from  v  for  a  wave  pro¬ 
pagating  across  the  array  parallel  to  the  ground 
to  infinity  for  a  wave  propagating  onto  the  ar¬ 
ray  from  directly  overhead. 

The  analysis,  in  effect,  hypothesizes  that 
a  wave  at  a  particular  frequency,  f,  is  crossing 
the  array  from  a  horizontal  angle  9  at  an  ap¬ 
parent  velocity,  C.  The  position  of  each  of 
the  array  microphones  is  used  to  calculate  the 
time  delay  or  phase  that  the  hypothetical  wave 
would  have  at  each  microphone.  The  data  meas¬ 
ured  at  each  microphone  are  appropriately  de¬ 
layed,  summed  with  all  the  other  such  data  and 
integrated.  Coherent  energy  with  the  ' hy¬ 
pothesized  azimuth  and  phase  velocity  adds  con¬ 
structively,  whereas  energy  from  other  direc¬ 
tions  and  at  other  phase  velocities  do  not. 


positions.  At  each  frequency,  the  power  at  ap¬ 
proximately  300  to  1000  wavenumber  points  is 
calculated.  Both  azimuth  and  phase  velocity 
resolution  depend  on  frequency,  and  the  number 
of  wavenumber  points  that  are  calculated  vary 
with  the  selected  frequency. 

The  input  data  are  N  »  10  sampled  waveforms 
collected  from  the  M  microphones  in  the  sensor. 
During  each  two-second  analysis  Interval,  data 
is  recorded  continuously  for  1.024  seconds  at  a 
2  kHz  per  channel  rate.  The  2048  samples  from 
each  channel  are  broken  up  into  K  a  5  overlap¬ 
ping  blocks,  each  of  N  *  512  points.  Each  block 
is  multiplied  by  an  appropriate  window  and  a 
Fourier  transform  is  performed  on  each  block. 
Thus,  N-point  Fourier  transforms  are  per¬ 
formed,  resulting  in  K  estimates  of  the  matrix 
periodograms,  X  (f  ).  The  computations 
described  from  this  point  are  all  performed  for 
each  frequency.  An  estimate  of  the  power  spec¬ 
tral  density  covariance  matrix  for  frequency, 
f^,  is  calculated: 


cij  -  £xi(vx},(fk 


), 


where  *  denotes  complex  conjugate.  The  result¬ 
ing  matrix  is  then  normalized  to  equalize  power 
in  each  channel: 


Hij(V  * 
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The  resulting  (9,C)  space  is  parameterized 
by  9  and  f/C  ,  where  K  *  f/C  is  called  the 
wavenumber  of  the  wave.  Another  way  of  inter¬ 
preting  X  is  2»/\,  ,  where  X,  is  the  apparent 
wavelength  of  the  wave.  The  possible  range  of  K 
and  9  is  the  area  inside  a  circle  with  radius 
f/C  as  shown  in  Figure  5.  In  the  analysis,  this 
area  is  discretized  and  the  beam  power  at  each 
point  is  calculated.  If  a  wave  indeed  is  pass¬ 
ing  through  the  array  with  apparent  velocity  C* 
from  direction  9* ,  the  power  will  peak  at 
(9', f/C')  or  (9* ,K').  The  equal-power  contours 
of  a  peak  that  indicates  a  wave  propagating 
parallel  to  the  plane  of  the  array  are  shown  in 
the  figure. 

3.  The  High  Resolution  Algorithm. 

There  are  two  basic  steps  in  the  high  reso¬ 
lution  frequency-wavenumber  analysis.  First, 
the  acoustic  array  data  are  processed  to  compute 
estimates  of  the  power  spectral  density  covari¬ 
ance  matrices  for  all  the  frequencies  of  in¬ 
terest.  (In  the  DSN  strawman,  approximately  50 
frequencies  between  5  and  250  Hz  will  be  con¬ 
sidered.)  Second,  at  each  frequency,  a  spatial 
wavenumber  analysis  is  performed.  Wavenumber 
power  estimation  corresponds  to  a  discrete  two 
dimensional  spatial  Fourier  transform  with  spa¬ 
tial  sampling  points  determined  by  microphone 


All  the  elements  along  the  diagonal  of 
RCf^)  are  one. 

Several  matrix  modifications  are  done  before  the 
actual  estimates  of  power.  First  a  small  diago¬ 
nal  matrix  is  added  to  the  RCf^)  to  force  the 
matrices  to  be  non-singular: 

R(fk)  *  RCf^+e  I. 

Then,  the  complex  RCf^)  are  replaced  by  their 
inverses .  *' 

The  final  step  is  to  estimate  power  for 
each  wavenumber  of  Interest.  The  estimation  of 
power,  given  frequency  and  wavenumber,  reduces 
to1  calculating  a  Hermit lan  quadratic  form  using 
Rtf.)  as  the  matrix  and  a  complex  steering 
vector,  E  The  elements  of  the  steering  vec¬ 
tor  are  complex  representations  of  the  phase  de¬ 
lay  for  each  of  the  microphone  locations  for  a 
given  wavenumber.  Specifically,  the  frequency- 
wavenumber  power  estimate  is 


where  1  is  the  wavenumber  index  and  k  is  the 
frequency  index.  The  superscript  H  implies  con- 


jugate  transpose. 

Power  levels  are  thus  computed  at  all  ap¬ 
propriate  wavenumbers  for  each  frequency  of  In¬ 
terest.  a  total  of  approximately  42,000 
frequency-wavenumber  bins.  A  detailed  computa¬ 
tional  sizing  of  real-time  high  resolution 
analysis  indicates  that  approximately  11  million 
real  arithmetic  operations  (adds  or  multiplies) 
per  second  are  required. 

4.  Detection  and  Estimation. 

Detection  and  estimation  consist  of  examln- 
ing  the  power  vs.  frequency  and  wavemaber 
results.  A  peak  in  power  that  rises  several  dB 
above  background  noise  levels  for  a  particular 
wavenumber  at  several  frequencies  most  likely 
indicates  a  target.  A  time  average  of  the 
results  is  kept  such  that  the  current  power- 
frequency-wavenumber  results  may  be  compared  to 
those  of  the  recent  past.  If  a  peak  occurs  in 
several  sequential  results  and  appears  to  be 
moving  at  a  reasonable  speed,  its  deteatian  is 
further  confirmed  and  one  can  initiate  an  az¬ 
imuth  track.  However,  if  the  target  does  not  ap¬ 
pear  to  be  moving,  it  is  either  traveling  on  a 
radial  path  towards  the  sensor  or  it  is  a  sta¬ 
tionary  noise  source.  Data  from  other  sensors 
might  clarify  such  a  situation.  Effective,  au¬ 
tomatic  target  detection  and  azimuth  estimation 
is  a  difficult  problem  and  is  currently  being 
addressed. 


HI-  Caagraat.  Pulaad  Oapplor  Radar 

1.  Introduction. 

The  DSN  strawman  radar  has  been  described 
in  a  previous  paper,  Lacoss,  "Tutorial  on  Sensor 
Technology",  (these  proceedings).  The  radar 
sensor  is  considerably  more  sophisticated  than  a 
small  acoustic  array  and  more  target  metrics  can 
be  estimated  from  the  return  signals  in  the  ob¬ 
servation  space:  target  range,  azimuth,  and 
velocity.  The  signal  processing  required  to 
calculate  appropriate  statistics  is  substantial¬ 
ly  different  from  that  used  with  the  acoustic 
array;  however,  the  results  and 
detectlon/estlmatlon  rules  used  to  detect  and 
determine  target  metrics  are  very  similar. 
Since  the  current  study  has  not  Investigated  the 
radar  problem  in  depth,  less  detail  will  be 
presented  than  was  for  the  acoustic  sensor. 

Pulse  doppler  radar  observation  space  is 
divided  into  range-azimuth-velocity  (doppler) 
bins.  The  return  times  of  the  reflected  pulses 
and  location  in  the  scan  of  the  antenna  deter¬ 
mine  into  which  range  and  azimuth  bins  the  re¬ 
turn  data  are  placed.  The  pulses  returning  dur¬ 
ing  a  coherent  processing  Interval  or  that  time 
during  which  the  antenna  "dwells"  on  a  target 
are  processed  and  power  levels  for  the  different 
doppler  shift  frequency  or  velocity  bins  result. 


Similar  to  that  for  the  acoustic  sensor,  detec¬ 
tion  and  estimation  for  radar  consists  of  deter¬ 
mining  the  existence  and  location  (the  proper 
bins)  cf  peaks  in  power,  which  are  presented  as 
a  function  of  range,  azimuth,  and  veloaity. 

2.  Radar  Processing. 

Briefly,  the  radar  signal  processing  con¬ 
sists  of  the  fallowing: 

-data  checking  and  calibration  corrections 
-high-pass  filter  (3-pulse  oanceler) 

-windowing 

-fast  Fourier  transform  and  magnitude  computation 
-threshold  computation,  target  detection. 

3.  Detection/Estimation. 

The  major  outputs  resulting  from  the  signal 
processing  are  (1)  map  data  and  (2)  target  re¬ 
ports.  The  asp  data  represents  a  picture  of 
zero-velocity  ground  clutter.  It  is  extracted 
from  those  range-azimuth  bins  with  zero-doppler 
shift.  Target  reports  are  declared  when  the 
power  in  non-zero  velocity  bins  exceed  various 
thresholds. 

Several  interesting  detection  problems  oc¬ 
cur.  When  a  target  has  zero  radial  velocity 
with  reference  to  the  radar,  its  power  returns 
appear  in  tne  zero  doppler  bins.  However,  the 
target's  power  will  move  with  reference  to  the 
relatively  stable  ground  clutter.  Such  targets 
can  be  detected  in  the  clutter  by  noting  any  ra¬ 
pid  time  variations  in  the  clutter  map. 

Often,  clutter  such  as  clouds  and  other 
weather  formations  can  have  non-zero  velocities 
and  mask  targets.  However,  the  clouds  usually 
cover  a  much  larger  area  (many  more  range- 
azimuth  bins)  than  an  aircraft.  An  appropriate 
time-varying  threshold  can  be  calculated  by 
averaging  over  an  area  consisting  of  several 
range-azimuth  bins  (at  constant  doppler  shift). 
Thus  the  larger  area  with  relatively  high  power 
is  not  declared  a  target,  but  the  smaller  and 
sharper  peak  is  detected. 

The  azimuth  resolution  of  the  strawman  ra¬ 
dar  is  approximately  10  degrees—36  azimuths  per 
4  second  scan  or  9  coherent  processing  intervals 
per  second.  There  are  100  range  bins  covering 
0.125  to  12.5  kilometers  (range  resolution  of 
approximately  120  meters).  The  coherent  pro¬ 
cessing  Interval  contains  returns  from  512 
pulses.  Thus,  information  from  900  range- 
azimuth  bins  is  processed  every  second:  900 
512-point  complex  FFTs  are  computed  per  second. 
The  net  computational  requirements  for  FFT  pro¬ 
cessing  is  '2.5  Billion  real  arithmetic  opera¬ 
tions  (adds  or  multiplies)  per  second.  It  is 
estimated  that  the  other  computations  listed 
above  require  an  equivalent  amount  of  computa¬ 
tion.  Thus,  the  total  computational  require¬ 
ments  for  the  DSN  strawman  coherent  pulsed 
doppler  radar  are  approximately  25  million  real 
arithmetic  operations  per  second. 
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LOCUS  OF  POSSIBLE  PEAKS 
FOR  DISTANT  AIRCRAFT 
WHOSE  SOUND  WAVES  ARE 
PROPAGATING  PARALLEL 
TO  THE  GROUND. 


0:  AZIMUTH  OF  SOUND  SOURCE 
V:  VELOCITY  OF  SOUND  IN  AIR 
K  «  F/C 


Figure  5.  Wavenuaber  power  plot 
Interpretation. 


LOCATION  OF  POWER ^ 
PEAK  FOR  AIRCRAFT 
DIRECTLY  OVER  SENSOR.  • 
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ABSTRACT 

A  distributed  systaa  can  be  conacrucEed  so 
that  modifications  can  be  made  while  it  la  running. 
Dynamic  modifications  include  simple  changes  in  the 
code,  modifications  in  che  representation  of  data 
objects  and  res cruc curing  of  individual  data  ob¬ 
jects  or  encire  modules.  Applied  to  a  heteroge¬ 
neous  computer  network,  coexistence  of  local  Imple¬ 
mentations  allows  optimal  performance  in  each  node. 
Special  care  must  be  taken  chat  modification  of  one 
part  of  the  running  syscem  does  not  causa  an  ava¬ 
lanche  of  chsngea  in  programs  chat  use  it. 

1.  INTRODUCTION  v 

It  is  generally  recognized  Chat  che  cost  of 
software  maintenance  exceeds  by  far  che  cost  of 
initial  syscem  design.  Software  maintenance  is  no 
longer  considered  co  be  a  temporary  debugging 
phase,  but  a  permanent  effort  co  Improve  system 
performance,  co  adapt  a  system  to  new  and  improved 
hardware  and  co  provide  batter  user  facilities. 
Therefore,  much  is  gained  by  facilitating  system 
maintainability. 

Maintenance  of  a  distributed  syscem  that  rune 
on  a  computer  network  causes  soma  specific  prob¬ 
lems.  It  may  be  impossible  or  very  undesirable  to 
bring  che  running  syscem  to  a  halt  and  shut  It 
down  for  soma  time  In  order  co  replace  same  of  its 
parts.  Typical  examples  of  such  systems  are  air¬ 
line  ticket  reservation  systems,  banking  systems, 
etc.  This  raises  the  question  of  constructing  a 
distributed  syscem  which  is  modifiable  while  it  is 
running.  This  is  the  topic  of  this  working  paper. 

The  auesclon  is  answered  by  exploring  in  par¬ 
ticular 

a.  syscem  facilities  needed  for  runtime  mod¬ 
ifiability 

b.  which  kind  of  modifications  can  be  put 
through  while  the  syscem  Is  running 
("changes  in  programs,  changes  in  data 
representation) 

c.  the  impact  of  modifications  on  the  cur¬ 
rant  scats  of  cha  running  system 

d.  the  integration  of  modified  parts  with 
tha  running  system. 

2.  ADDRESS  SPACES 

An  important  development  In  programing  is 
che  3hlfc  in  emphasis  from  control  flow  co  data 
abstraction.  It  used  co  be  customary  to  partition 


a  task  along  the  time  dimension  of  its  subsaquanc 
accivltias.  Research  in  programing  methodology 
[1],  software  engineering  [2]  and  programming  lan¬ 
guage  daeign  [3 ]  demons  traced  cha  uaafulnaes  of 
organizing  programs  and  software  around  data  ob¬ 
ject  definitions  and  chair  manipulations. 

The  baaic  concept  for  designing  a  dynamically 
modifiable  system  is  that  of  "Address  Space".  An 
address  space  is  a  design  module  which  provides 
certain  faeilitlas  that  can  be  used  in  other  ad¬ 
dress  spaces  [4].  Typical  facilities  provided  by 
an  address  space  are: 

a.  craaclon  of  data  objects  which  are  moni¬ 
tored  by  that  address  space 

b.  operations  on  data  objects  monitored  by 
that  address  space. 

An  addrasa  space  la  determined  by  three  components, 
its  specification,  its  own  data  and  its  code.  Tha 
specification  part  is  discussed  further  on  in  this 
paper.  The  own  data  part  cons lata  of  che  data 
structures  which  are  needed  for  monitoring  tha  ob¬ 
jects  created  on  behalf  of  ocher  address  spaces. 
Typical  examples  are  a  ready  list  used  as  own  data 
In  a  process  scheduling  address  space  and  a  Symbol 
Table  used  by  a  lexical  scanner  (Implemented  as 
address  space) .  Die  coda  part  of  an  address  space 
consists  of  subroutine  and  function  program. 

Tha  parts  of  an  address  space  are  placed  In 
virtual  memory  segments.  A  segment  Is  accessible 
thrmigh  a  segment  descriptor  which  describes  loca¬ 
tion,  size  and  current  state  of  a  segment.  In  ad¬ 
dition,  each  segment  descriptor  has  a  link  field 
which  may  point  to  another  segment  descriptor  and 
a  reference  count  which  reflects  che  number  of 
descriptors  pointing  to  it.  Tha  descriptors  of  an 
address  space  are  linked  in  the  order  code  seg¬ 
ment,  own  segment,  spec  segment.  A  schematic 
representation  of  an  address  space  is  given  in 
Figure  l. 

Data  objects  are  also  placed  in  virtual  mem¬ 
ory  segments.  A  data  segment  is  attached  to  the 
address  space  that  monitors  ehe  oojects  in  that 
segment  by  linking  its  descriptor  to  the  coce 
segment  descriptor  of  thac  address  space.  The 
reference  count  of  a  code  segment  is  equal  to  the 
number  of  data  segments  chat  are  actached  to  an 
address  space. 

3.  DYNAMIC  MODIFICATIONS 

Modifications  of  an  address  space  affect  the 
various  types  of  segments  in  various  ways.  We 
classify  modifications  into  four  categories  '5]: 
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code  segment  own  segment  spec  segment 

descriptor  j  descriptor  descriptor 


Figure  1.  Schematic  Representation  of  an  Address  Space 


1.  modifications  which  change  the  code,  but 
leave  all  data  representations  as  they 
are  (ccssod) 

2.  modifications  which  change  both  code  and 
data  representation  (repood) 

3.  remodeling  of  existing  daca  objects  into 
a  new  representation  (datamod) 

i.  modifications  which  change  the  represent* 
tion  of  own  daca  (owneod) . 

3.1  Code  Modifications 


c :  o :  s  : 


c'  : 


Figure  2.  Data  Representation  Modification  "repmod” 


Let  address  space  AS  be  determined  by  a  code 
- ,  an  own  -  and  a  spec  segment  whose  respective 
descriptors  are  c,  o  and  s.  Let  c'  be  the  descrip- 
cor  pointing  to  a  new  version  of  the  code  for  AS. 
We  now  wish  to  replace  old  version  c  by  new  ver¬ 
sion  c'  (assuming  chat  the  representation  of  data 
remains  unaltered).  This  is  accomplished  in  a 
very  simple  way.  All  we  need  to  do  is  to  exchange 
the  chree  fields  (loc,  size,  state)  of  the  descrip 
tors  c  and  c'.  The  effect  of  chis  exchange  opera¬ 
tion  is  chat  Che  old  descriptor  c  now  points  to 
che  new  version  of  Che  code  and  the  new  descriptor 
c'  to  che  old  code  version.  The  advantage  of 
changing  to  the  new  code  version  In  this  way  is 
Chat  che  reference  count  is  preserved  and  that  the 
linking  of  data  segments,  code  segment,  own  seg¬ 
ment  and  spec  segBMnt  is  not  affected  at  alii 

3.2  Data  Representation  Modifications 

We  Introduce  che  notion  of  che  "current  ver¬ 
sion”  of  an  address  space.  The  current  version  is 
che  most  recently  Installed  (code,  own)  segmenc 
pair.  We  use  che  link  field  in  Che  3pec  segment 
descriptor  Co  point  to  the  code  segment  of  the  cup 
rent  version  (see  Figure  2) . 

In  case  of  "repood",  a  new  version  is  intro¬ 
duced  vlthouc  deleting  older  versions.  The  new 
code  segmenc  descriptor  c'  is  linked  to  the  exist¬ 
ing  own  segmenc  descriptor  o,  o's  ref  count  is 
incremented  by  one  and  che  version  pointer  in  spec 
segmenc  descriptor  s  is  sec  to  point  to  c'  (see 
Figure  2) . 


The  reason  for  keeping  che  old  version  around 
is  that  "old"  data  objects  (objects  which  were 
created  before  the  representation  was  modified) 
oust  be  handled  by  che  old  code  version,  because 
they  still  have  the  old  representation.  New  data 
objects  which  are  created  after  the  representation 
modification  are  automatically  shaped  after  che 
latest  fashion  and  attached  co  che  current  version 
c'  which  is  found  through  che  version  pointer  in 
s . 


If  che  representation  of  an  address  space  is 
modified  several  times,  there  may  be  a  number  of 
different  versions  (their  number  is  counted  in 
ref  count  (o)),  each  having  a  number  of  daca  ob¬ 
jects  attached.  All  these  objects  are  still 
usable  and  correctly  operated  upon  through  che 
pointers  of  daca  segment  descriptors  co  code  ver¬ 
sions  . 


3.3  Remodeling  of  Data  Jb-ects 

Daca  representation  modification  as  discussed 
in  the  preceding  section  does  not  automatically 
include  reshaping  of  old  data  objects.  We  saw 
chat  this  is  not  necessary,  because  old  objects 
can  still  be  operated  upon  by  the  old  version  of 
che  code.  It  is  also  difficult  to  reshape  all 
daca  objects  automatically,  because  there  is  no 
facility  provided  to  find  all  objects  attached  to 
an  address  space.  Their  number  is  known,  but  no 
backwards  referencing  is  provided. 
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However,  when  an  old  dace  object  Is  accessed, 
its  user  should  have  the  option  of  reshaping  it 
according  to  the  most  recant  daca  representation 
modification.  This  can  be  done  if  it  is  known  how 
the  current  state  of  an  old  object  is  mapped  into 
an  equivalent  scate  of  a  new  object.  This  mapping 
must  be  provided  by  the  designer  of  a  new  data 
representation.  The  mapping  can  be  defined  as  a 
pair  of  prograsu  "decode", "encode".  (This  solution 
differs  significantly  from  the  one  presented  in 
[*].)  Assuming  that  a  simple  canonical  represent* 
tion  of  data  objects  is  given,  procedure  "decode" 
maps  the  current  state  of  an  old  object  into  the 
canonical  representation.  Procedure  "encode"  maps 
this  canonical  version  into  an  initialized  object 
in  the  current  representation  so  thac  the  initial* 
lzed  new  object  is  equivalent  to  the  state  of  the 
old  object. 

When  the  old  object  is  deleted,  the  reference 
counc  of  che  code  descriptor  it  points  to  is  deer* 
menced.  If  chis  reference  counc  goes  to  zero,  no 
object  is  attached  co  this  version  any  longer,  so 
it  can  be  deleted  (while  the  reference  counc  in 
che  own  descriptor  is  decremented) .  The  new  ob¬ 
ject  created  by  encode  is  attached  co  che  current 
code  segment  descriptor  (whose  reference  counc  is 
incremented  by  one) . 

After  several  representation  modifications 
there  may  be  several  versions  in  use  which  each 
need  their  specific  decoding  procedure.  Thus,  a 
natural  place  to  put  a  decoding  procedure  is  in 
the  code  segment  of  its  version.  There  are  two 
reasons  which  suggest  that  Che  natural  place  for 
che  encoding  procedure  is  in  a  unique  spec  seg¬ 
ment  pointed  co  by  che  spec  segment  descriptor. 
First,  there  is  only  one  current  version,  so  a 
single  encoding  procedure  suffices.  Second,  in 
che  same  way  chat  a  decoding  procedure  defines  a 
path  from  an  (old)  implementation  co  che  specs, 
the  encoding  procedure  defines  a  path  from  the 
specs  co  che  current  implementation. 


On  the  ocher  hand,  each  Clma  che  data  repre¬ 
sentation  is  modified,  che  designer  of  che  modifi¬ 
cation  must  provide  a  decode  procedure  (in  order 
co  enable  remodeling  in  che  future)  and  an  encode 
procedure  for  reshaping  existing  data  objects. 
Although  these  procedures  are  not  used  at  che  same 
time,  che  fact  thac  they  are  designed  and  written 
together  with  the  new  code  version  is  an  argument 
in  favor  of  including  both  procedures  la  the  new 
code  segment.  This  has  che  additional  advantage 
thac  there  is  no  need  for  "rapmod"  co  set  the  spec 
segment  descriptor  to  che  new  encoding  procedure. 
3ut  it  has  che  disadvantage  thac  old  encode  pro¬ 
cedures  which  will  never  be  used  again  remain  in 
existence  and  occupy  memory  space. 


Reshaping 


The  most  radical  change  of  an  address  space 
is  caused  by  modifying  the  representation  of  own 
daca.  This  affects  coda  and  own  segment  and  pos¬ 
sibly  che  representation  of  data  objects  as  well. 
Moreover,  since  own  data  is  unique,  the  old  own 
segment  must  be  slapped  into  the  new  own  segment 


immediately  when  the  new  representation  is  Intro¬ 
duced  and  the  old  code  and  own  segments  must  be 
invalidated.  This  Implies  thac  all  old  daca  seg¬ 
ments  are  also  useless,  because  they  cannot  use 
che  old  address  space  version. 

The  designer  of  a  new  own  daca  representation 
must  provide  a  procedure  for  mapping  the  current 
own  segment  with  all  its  information  into  an 
equivalent  new  own  segment.  This  procedure  is 
executed  exactly  once  when  "ownmod"  is  performed. 

In  addition,  he  oust  provide  a  new  code  version, 
including  encode  and  decode  procedures.  The  ef¬ 
fect  of  "ownmod"  on  the  linking  information  is 
thac  a  new  cycle  (c'  -  o'  -  s  -  c')  is  formed 
where  C  and  o'  are  the  new  code  and  own  segment 
descriptors.  The  old  links  are  not  deleted  except 
for  the  current  version  pointer.  Although  all  old 
daca  objects  are  useless,  "ownmod”  is  not  able  co 
find  these  objects  and  remodel  chem.  This  does 
not  matter,  because  we  can  wait  for  each  data  ob¬ 
ject  uncil  it  is  accessed.  By  that  time  it  is 
found  thac  the  code  segment  descriptor  it  is  at¬ 
tached  to  has  been  invalidated.  The  daca  object 
is  then  reshaped  by  applying  the  decode  procedure 
of  che  old  version  and  the  current  encode  proce¬ 
dure.  The  newly  created  object  is  attached  co  the 
current  code  descriptor  and  che  old  object  is 
deleted. 

It  turns  out  thac  reshaping  an  old  object  is 
accomplished  by  "dacamod"  in  all  cases.  However, 
after  a  "rapmod"  applying  "dacamod"  co  an  old  daca 
object  is  optional,  whereas  after  "ownmod"  applica¬ 
tion  of  "datamod"  is  mandatory. 

i.  REPRESENTATIONS  IN  COMPUTER  NETWORKS 

In  case  the  nodes  of  a  computer  network  are 
not  identical,  a  variation  on  che  ideas  described 
in  preceding  sections  is  useful  for  "node  specific1 
implementations  of  address  spaces.  Let  us  assume 
that  most  operations  on  daca  objects  are  performed 
locally  in  the  network  node  in  which  che  object 
resides.  If  Che  nodes  are  different  machines,  it 
is  likely  thac  one  can  find  different  ooclmal 
implementation  for  each  node.  The  following  orga¬ 
nization  makes  it  possible  to  use  such  optimal 
implementations. 

An  address  space  has  a  unique  own  segment 
which  resides  in  some  node.  Operations  on  own 
daca  take  place  across  the  network  (and  should 
therefore  be  kept  co  a  minimum) .  We  abandon  che 
idea  of  a  unique  current  version  of  che  code. 
Instead,  each  node  of  che  network  will  have  its 
own  version  (possibly  more  than  one)  and  its  own 
version  of  representing  daca  objects.  All  code 
versions  point  to  the  unique  own  segment  descriptor 
and  all  objects  in  a  node  are  attached  to  its  local 
code  version.  Efficiency  is  achieved  if  most  op¬ 
erations  on  daca  objects  are  performed  locally. 

In  order  to  allow  migration  of  daca  objects 
from  one  node  Co  another,  each  version  of  a  code 
segment  should  concain  a  decode  and  encode  pro¬ 
cedure.  When  a  daca  object  is  transferred  from 
node  A  to  node  3,  we  apply  the  decode  procedure  of 
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A  Co  it  and  Che  encode  procedure  of  E.  This  guar¬ 
antees  full  flexibility  among  all  nodes  chac  hose 
an  implementation  of  a  particular  address  space. 

It  is  obvious  chac  this  scheme  pays  off  only 
if  the  frequency  of  migrating  data  objects  is  much 
smaller  chan  chac  of  local  operations.  The  beauty 
of  this  organization  is  chat  the  local  implementa¬ 
tions  are  entirely  independent  of  one  another. 
Implementation  details  of  one  node  are  entirely 
hidden  from  other  nodes.  Comounicacion  cakes 
place  through  a  simple  general  specification. 

5.  INTEGRATION  OF  NEW  VERSIONS 
INTO  THE  RUNNING  SYSTEM 

If  a  code  segment  is  replaced  by  a  new  ver¬ 
sion,  we  must  assume  chac  the  starting  addresses 
of  corresponding  subroutines  and  functions  in  Che 
two  versions  are  aoc  the  same.  The  quescion  is 
how  programs  of  ocher  address  spaces  refer  to 
these  procedures. 

The  thing  we  wish  to  avoid  is  that  existing 
code  oust  be  changed  because  code  it  has  been 
linked  to  has  been  replaced.  This  problem  is 
solved  by  using  "entry  vectors"  in  code  segments. 

An  entry  vector  is  an  array  of  procedure  starting 
addresses.  Each  procedure  is  associated  with  a 
particular  element  of  the  entry  vector.  Programs 
in  other  code  segments  call  a  procedure  by  its 
entry  vector  element  which  is  the  same  for  differ¬ 
ent  versions  of  the  code. 

The  cost  of  this  solution  is  an  extra  level 
of  indirection  in  calling  a  procedure.  Indirec¬ 
tion  is  inexpensive  on  most  machines  and  negligible 
compared  to  the  normal  overhead  of  a  procedure 
call.  The  great  advantage  is  in  the  fact  that 
modifications  affect  the  new  version  only,  while 
all  existing  programs  using  it  do  not  need  to  be 
changed. 

It  is  not  necessary  for  a  user  to  Include 
entry  vectors  in  his  program,  or  for  a  compiler  to 
generate  them.  It  is  a  simple  matter  for  a  link 
editor  to  derive  a  new  entry  vector  from  the  new 
code  version  and  the  old  entry  vector.  If  it  is 
allowed  that  procedures  are  given  in  a  different 
lexical  order  in  a  new  version,  it  is  necessary  to 
remember  che  mapping  of  procedure  names  onto  entry 
vector  elements.  This  information  is  useful  any¬ 
way,  because  it  facilitates  resolving  external 
procedure  references  in  new  programs. 

CONCLUSION 

A  system  can  be  constructed  so  chat  it  is 
modifiable  while  it  is  running.  The  kind  of  modi¬ 
fications  which  can  be  made  in  a  running  system 
vary  from  simple  changes  in  programs  to  modifica¬ 
tions  of  entire  address  spaces.  It  is  possible  to 
work  wich  data  objects  of  different  representations 
in  a  single  address  space.  Old  representations 
lose  their  validity  only  if  an  entire  address  space 
is  modified. 

The  use  of  different  data  representations  and 


code  implementations  may  be  very  useful  for  a  het¬ 
erogeneous  computer  neework.  Local  operations  can 
be  very  efficient  and  migration  of  objects  is  made 
possible  by  local  decode  and  encode  procedures. 

Modifications  in  one  address  space  should  not 
necessitate  changes  in  other  address  spaces  which 
use  its  facilities.  The  use  of  entry  vectors 
assures  that  references  to  external  procedures 
remain  correct  when  the  defining  code  segment  is 
modified. 
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1.  Introduction 

The  PSI  knowledge  based  programing  system  is  a 
computer  program  that  acquires  high  level  descrip¬ 
tions  of  programs  and  produces  efficient  Imple¬ 
mentations  of  these  programs  [Green-76].  Simple 
symbolic  computation  programs  are  specified 
through  dialogues  that  Include  natural  language, 
input-output  pair3,  and  partial  traces.  The  pro¬ 
grams  produced  are  in  LISP,  but  experiments  have 
shown  that  the  system  can  be  extended  to  produce 
code  in  a  block  structured  language  such  as  PASCAL. 

This  paper  provides  a  brief  description  or  the  PSI 
system  and  discusses  its  applicability  to  the  cask 
of  synthesizing  signal  understanding  systems  in 
time-critical  environments. 


The  PSI  Program  Synthesis  System 


The  PSI  system  works  as  follows.  The  user  wants  a 
performance  program  of  some  type,  for  example,  a 
news  story  indexing  program.  The  user  must  be 
versed  in  the  application  area,  though  need  not  be 
an  expert  at  programming  or  understand  how  the 
PST  system  writes  programs.  The  user  conducts  a 
dialogue  with  PSI,  using  natural  language  as  well 
as  traces,  examples,  and  very  high  level  languages 
co  describe  the  desired  program.  PSI  synthesizes 
the  program.  Since  program  specification  and  use 
is  an  evolutionary  process,  each  successive  ver¬ 
sion  of  the  candidate  target  program  is  coded  or 
interpreted  by  PSI  and  tested  by  the  user.  Modi¬ 
fications  in  specification  result  in  successive 
versions  of  the  target  program  as  new  requirements 
develop  or  previous  requirements  are  clarified. 


PSI  is  organized  as  a  collection  of  interacting 
modules  or  programmed  experts  as  displayed  in 
Figure  1.  The  major  data  paths  and  modules  of  the 
PSI  system  are  3hown  in  Figure  2.  There  is  one 
data  path  for  each  specification  method.  Cur¬ 
rently  these  are  English,  input-output  examples, 
and  partial  traces.  A  more  conventional  method, 
that  of  a  very  nigh  level  language.  Is  a  planned 
addition  to  PSI  as  shown  in  Figure  2.  These 
specifications  are  integrated  In  the  program  net 
and  model. 


PSX's  operation  may  be  conveniently  factored  into 
two  parts  (see  Figure  1) :  the  acquisition  phase 
(chose  modules  shown  left  of  Che  program  model), 
which  acquires  Che  model,  and  the  synthesis  phase 
wnlch  produces  a  program  from  Che  model. 


In  the  acquisition  phase,  sentences  are  first 
parsed,  then  interpreted  and  scored  in  the  program 
net  (also  referred  to  as  the  "program  specifica¬ 
tion"  in  [Glnsparg-78] ) .  The  parser  is  a  general 
parser  which  limits  search  by  incorporating  consi¬ 
derable  knowledge  of  English  usage.  The  inter¬ 


preter  i3  more  specific  to  program  synchesis, 
using  program  description  knowledge  as  well  as 
knowledge  about  Che  question  asked  and  the  cur¬ 
rent  copic  co  facilitate  interpretation  into  the 
program  net. 

The  dialogue  moderator  guides  the  dialogue  by 
selecting  or  suppressing  questions  for  the  user. 

It  attempts  co  keep  PSI  and  che  user  in  agreement 
on  che  current  topic,  provides  a  review  and  pre¬ 
view  of  topics  when  che  copic  changes,  helps  the 
user  who  gees  lost,  and  allows  initiative  to 
shift  between  PSI  and  the  user. 

A  new  module  is  che  explainer,  which  generates 
in  English  reasonably  clear  questions  about  and 
descriptions  of  program  models  as  they  are  ac¬ 
quired,  in  order  co  help  verify  that  the  inferred 
program  description  is  che  one  desired.  It  will 
also  be  able  to  explain  the  how  and  why  of  che 
acquisition  and  synthesis  process  to  the 
interested  user. 

Anocher  input  specification  method  is  a  partial 
crace  [Phillips-77],  A  trace  includes  as  a  spe¬ 
cial  case  an  example  Input-output  pair.  Examples 
are  useful  for  inferring  data  structures  and  sim¬ 
ple  spatial  transformations.  Partial  traces  of 
states  of  internal  and  I/O  variables  allow  the 
inductive  Inference  of  control  structures.  The 
crace  and  example  Inference  expert  Infers  a  loose 
description  of  a  program  in  che  form  of  a  program 
net,  rather  chan  a  program  model  or  ocher  true 
algorithm.  This  technique  allows  domain  support 
to  disambiguate  possible  Inferences  and  also 
separates  the  issue  of  efficient  Implementation 
from  the  inferenece  of  the  user.'s  intention. 

Various  types  of  programming  knowledge  are 
distributed  throughout  the  modules  of  che 
acquisition  phase.  In  contrast,  knowledge  speci¬ 
fic  to  one  particular  application  domain  ie.g., 
knowledge  abouc  learning  programs)  is  concencraced 
in  the  domain  expert,  which  supplies  domain 
support  by  communicating  with  ocher  acquisition 
modules  through  che  program  nec. 

The  program  net  and  the  program  model  (see  Fig¬ 
ure  2)  are  cwo  of  the  major  interfaces  within 
PSI.  Both  are  high  level  program  and  data 
structure  description  languages.  The  program 
model  Includes  complete,  consistent,  and  inter¬ 
precable  very  high  level  algorithm  and  information 
structures.  The  program  net,  on  che  ocher  hand, 
forms  a  looser  program  description.  Fragments  of 
the  program  net  can  be  visicea  in  che  order  of 
occurrence  in  the  dialogue,  rather  chan  in  execu¬ 
tion  order,  and  allow  less  decailed,  local,  and 
only  partial  specification  of  the  program.  Since 
these  fragments  corresoond  rather  closely  to  wnac 
the  user  says,  they  ease  the  burden  of  cne  parser/ 
interpreter  as  well  as  che  crace  and  exaooie 
inference  module. 

The  program  model  builder  [McCune-77]  applies 
knowledge  of  correct  program  models  to  convert 
che  fragments  into  a  model.  The  model  builder 
processes  fragments,  checking  for  ccnrleteness 
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and  correctness,  tills  in  decail,  corrects  minor 
inconsistencies,  and  adds  cross-references.  It 
also  generalizes  the  program  description,  con¬ 
verting  it  into  a  fora  chat  allows  che  coder  to 
look  for  good  implementations.  The  completed  pro¬ 
gram  model  may  be  interpreted  by  the  model  inter¬ 
preter  co  check  that  it  performs  as  desired  by  the 
user  and  also  to  gather  information  needed  by  the 
efficiency  expert,  such  as  statistics  on  set  sizes 
and  probabilities  of  che  outcome  of  cases. 


After  che  acquisition  phase  is  cooplece,  che 
synthesis  phase  begins.  This  phase  may  be  viewed 
as  a  series  of  refinements  of  Che  program  model 
loco  an  efficient  program,  or  as  a  heuristic 
search  in  a  refinement  tree  for  an  efficient  pro¬ 
gram  that  satisfies  the  program  model.  The  coder 
[Barscow-77]  has  a  body  of  program  synthesis 
rules  [Green  4  3arscow-75,  Green  4  3arstow-77j 
which  are  applied  co  gradually  transform  the  pro¬ 
gram  model  from  abstract  into  more  detailed  con¬ 
structs  until  it  is  in  che  targec  language.  The 
algorithm  and  daca  structures  are  refined  iacer- 
dependently.  The  coder  deals  primarily  with  the 
nocions  of  sec  and  correspondence  operations  and 
can  synthesize  programs  involving  sequences, 
loops,  simple  input  and  output,  linked  lists, 
arrays,  and  hash  cables. 

The  refinement  tree  effectively  forms  a  planning 
space  that  proposes  only  legal,  but  possibly 
inefficient,  programs.  This  tree  structure  la 
shared  by  che  coder  and  the  efficiency  expert 
[Kant-77],  When  the  coder  proposes  more  chan  one 
refinement  or  implementation,  the  efficiency  ex¬ 
pert  reduces  che  search  by  estimating  the  time- 
space  cost  product  of  each  proposed  refinement. 

The  better  path  is  followed,  and  there  la  no  back¬ 
up  unless  the  estimate  later  proves  co  be  very 
bad.  An  additional  method  co  reduce  the  size  of 
che  search  space  is  che  factorization  of  the  pro¬ 
gram  Into  relatively  independent  parts  so  chat  all 
combinations  of  lmplmencaclons  are  not  conaldered. 
An  analysis  for  bottlenecks  allows  che  synthesis 
effort  to  concentrate  on  che  more  critical  parts 
of  che  program. 

The  encire  PSI  system  can  now  be  used  by  a  know¬ 
ledgeable  user.  Programs  have  been  generated  from 
English  dialogues  for  a  variety  of  domains.  Among 
these  are: 

•CLASS:  A  simple  pactern  classification  program 
which  requires  much  of  the  programming  know¬ 
ledge  neceeaary  for  more  complex  programs. 

•TF:  Theory  Formation,  whose  goal  is  co  "learn'' 
(form)  an  internal  model  of  a  concept  by 
repeatedly  examining  examples  of  the  concept. 

•NEWS:  An  information  retrieval  program. 

•Sorting  algorithms:  Efficient  sorting  algo¬ 
rithms  for  specific  sorting  requirements. 


allcacion  to  Signal  Understanding  Systems 


One  new  application  of  knowledge  based  programming 
is  in  che  area  of  processing  acoustic  data  from  a 
distributed  sensor  network.  '7e  are  investigating 
the  automatic  -generation  of  the  narmonic- 


set  formation  portion  of  the  SIAP  system 
[Drazovich  4  Brooks-78]. 

In  addicion,  we  are  working  on  basic  issues  so 
that  similarly  developed  systems  can  be  applied  in 
other  important  military  mission  areas  such  as 
electronic  warfare  surveillance.  In  che  undersea 
surveillance  application  area  the  signal  under¬ 
standing  program  13  designed  to  produce  a  descrip¬ 
tion  of  che  ocean  scenario,  changing  with  time, 
that  indicates  the  platforms  in  the  ocean  that  are 
generating  the  signals  being  perceived  by  the  sen¬ 
sors  of  che  undersea  surveillance  system.  The 
ships  and  submarines  being  detected  and  tracked 
are  in  a  noisy  ocaan  environment  that  may  also 
contain  ocher  ships  of  no  interest  to  the  surveil¬ 
lance  system. 

Suppose  an  analyst  is  attempting  co  analyze  sig¬ 
nals  being  received  by  hydrophones  directed  coward 
a  group  of  submarines  and  other  placforms  in  a 
noisy  environment.  The  analyst  must  find  the 
location  and  type  of  each  ship  and  associate  each 
frequency  found  with  a  likely  source.  This  task 
is  known  to  be  quite  difficult.  It  far  exceeds 
the  capabilities  of  any  straightforward  parametric 
classification  or  pactern  recognition  system.  It 
is  at  the  limit  of  what  is  achievable  by  knowledge 
baaed  signal  understanding  systems  consisting  of 
large  rule  bases  and  programs  that  model  the 
sources  (blades,  shafts,  pumps,  etc.),  harmonic 
and  ratio  relations  of  sources,  types  of  sources 
on  platforms,  operational  patterns,  the  ocean 
environment,  che  noise  sources,  maximum  speed  of 
che  platforms,  whether  the  locations  are  shipping 
lanes,  and  so  forth.  If  the  placforms  decide  co 
change  their  sound,  they  could  disguise  themselves 
effectively  by  changing  their  source  characteris¬ 
tics  (e.g.,  by  using  tone  altering  synthesizers, 
running  close  together,  using  alternate  pumps  and 
acoustic  masking  devices,  running  near  sound-re¬ 
flecting  structures,  and  altering  their  operating 
patterns) .  The  situation  can  be  further  compli¬ 
cated  by  che  Introduction  of  new  types  of  micro¬ 
phones  or  signal  processing  systems.  With  these 
kinds  of  changes  happening  the  problem  Is  challen¬ 
ging  indeed.  The  problem  is  that  che  signal  un¬ 
derstanding  program  would  have  to  be  preprogrammed 
to  anticipate  each  possible  change  in  daca  races, 
harmonic  structures,  amplitude  and  frequency  modu¬ 
lation,  etc.  Similarly,  any  approach  using 
learning  would  also  have  co  anticipate  all  of  che 
types  of  changss  that  could  be  expected  and  co  be 
able  to  search  che  very  large  space  of  possible 
changes  to  find  nsw  patterns.  It  is  quite  unlike¬ 
ly  that  it  will  be  possible  co  anticipate  all  such 
changes. 

A  more  reasonable  approach  might  be  to  allow  the 
signal  understanding  system  to  be  reprogrammed  to 
respond  to  new  patterns.  The  difficulty  now  is 
the  time  to  reprogram  and  debug  a  complex  system 
in  the  short  time  allowed  in  a  tactical  situation. 
The  necessary  reprograming  and  debugging  is  of 
course  a  slow  process.  A  soiucion  co  the  repro¬ 
gramming  problem  is  to  use  an  automatic  program 
synthesis  system  co  reprogram  or  modifv  existing 
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programs  and  daca  structures  to  taeec  the  new 
requirements. 

A  scenario  £ or  the  response  to  new  signal  char¬ 
acteristics  might  begin  with  the  signal  under¬ 
standing  system  railing  to  respond,  providing  in¬ 
formation  inconsistent  with  ocher  observations, 
or  reducing  certainty  factors  for  identified 
sources.  We  assume  that  Che  appropriate  portion 
of  the  existing  signal  understanding  system  was 
Itself  automatically  synthesized,  and  that  the 
associated  explainer  module  could  describe  in 
English  the  performance  of  Che  system  and  help 
pinpoint  the  type  of  signal  changes  that  are 
causing  the  problem.  The  user  requests,  in  Eng¬ 
lish,  probes  into  the  data  to  look  for  any  pat¬ 
terns  that  characterize  sources  chac  seem  co 
cause  crouble.  A  user  may  know  from  another 
observation  the  Identity  and  locaclon  of  a  parti¬ 
cular  source,  tf  so,  the  user  could  request  a 
learning  program  to  find  paccarns,  expressed  as 
rules,  thac  characterize  thac  source.  The  signal 
understanding  system  would  then  be  automatically 
reprogrammed  co  use  the  new  rules  and  reanalyze 
the  signals.  This  interactive  process  is  repeated 
until  a  satisfactory  understanding  of  the  signals 
is  achieved. 

A  more  difficult  situation  occurs  when  the  cruch 
of  a  situation  cannot  be  established  by  means  of 
any  controlled  experiments,  which  is  frequently 
the  case.  For  example,  in  a  noisy  ocean  environ¬ 
ment  one  can  never  positively  identify  all  Che 
placforms  to  determine  what  is  really  producing 
the  signals.  Since  one  cannot  ascertain  truth, 
one  can  only  judge  chac  a  given  signal  analysis 
is  satisfactory  according  co  some  sec  of  criteria. 
Then  Che  program  must  still  find  new  rules  chac 
produce  some  satisfactory  model  of  the  situation, 
according  to  criteria  that  an  adequate  model  of 
che  situation  would  satisfy. 

The  difficulty  is  compounded,  in  that  it  will 
not  in  general  be  possible  co  anticipate  what 
criteria  or  meta-rules  a  satisfactory  model  must 
satisfy.  For  example,  che  analyst  might  suddenly 
notice  chac  the  number  of  submarines  has  drasti¬ 
cally  Increased.  One  might  add  a  constraint  noc 
anticipated  by  the  system  designer  chat  it  isn't 
possible  for  submarines  to  replicate  themselves. 
The  cause  for  che  Increase  in  sound  sources  might 
be  thac  some  sound  generator  waa  altered  and  the 
harmonics  produced  were  taken  as  separate  sources . 
It  would  then  be  appropriate  co  relax  the  con¬ 
straints  on  grouping  of  harmonics  so  that  pre¬ 
viously  disallowed  harmonic  structures  would  be 
acceptable  if  chey  arise  from  the  same  location. 
Another  situation  might  be  chat  the  sounds  weren't 
recognized  because  che  submarines  began  moving  at 
speeds  chat  were  noc  anticipated.  One  might  have 
che  system  generate  its  best  hypothesis  chat 
assumes  that  anything  moving  very  quickly  or 
erratically  is  not  really  a  submarine  but  instead 
a  decoy.  One  could  also  add  constraints  for  a 
hypothesis  that  best  explains  all  possible  sound 
sources  or  is  least  likely  to  miss  especially 
interesting  ones. 


The  first  major  cask  for  our  new  system  (called 
CHI)  in  undersea  surveillance  is  reprogramming 
the  signal  classif icacion  module  illustrated  in 
Figure  3.  The  system  we  develop  will  input  che 
old  signal  classification  module,  plus  new  signal 
classification  rules  in  a  language  natural  for 
expressing  cheat.  CHI  will  produce  as  output  a 
modification  of  the  original  signal  classification 
program  which  appropriately  makes  use  of  these  new 
rules.  The  classification  program  is,  in  this 
case,  primarily  a  harmonic-sec  formation  program 
thac  partitions  the  sac  of  frequency  signals  into 
a  harmonically  related  group  produced  by  one 
source  of  one  platform.  The  claasif Icacion  pro¬ 
gram  will  use  as  primitive  operations  (1)  existing 
primitives  of  the  targec  programming  language,  (2) 
signal  retrieval  commands  to  a  daca  management 
system,  and  (3)  subroutines  in  a  simple  statistics 
library. 

The  second  major  task  being  considered  for  CHI 
is  co  writs  a  module  that  learns  or  hypothesizes 
new  pattern  classification  rules  on  its  own.  The 
input  for  this  task  is  a  list  of  constraints  chat 
all  rules  must  satisfy.  The  output  is  a  program 
chat  modifies  old  rule  secs  based  upon  new  signal 
information  about  known  situations. 

The  feasibility  of  many  other  applications  may  not 
be  far  off.  The  promise  lies  in  our  approach; 
namely,  thac  of  building  a  large  knowledge  based 
system  that  emphasizes  the  codification  of  under¬ 
lying  programming  principles  combined  with  appli¬ 
cation  specific  expertise.  Some  generality  has 
already  been  demonstrated  by  extending  PSI  to  deal 
with  ostensibly  different  kinds  of  programs,  using 
essentially  the  saoie  knowledge  base. 
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A.  INTRODUCTION 

"Morse  coda  (at  sent  by  hand)  ia  ana  of  tha 
simplest  of  aural  languages,  yat  it  haa  faaturaa  common 
to  all  tpokon  languages."  That  waa  writtan  nearly  20 
yoara  ago  by  M.  Fraimar  at  al  [1].  Until  racantty  machine 
recognition  of  manuM  Morao  code  (ita  traneiation  to 
printed  text)  aiudad  aecaptabio  solutions,  albeit  not  from 
lack  of  attention.  A  project  in  tha  mid-fifties  at 
Massachusetts  Inatituto  of  Technology’a  Lincoln 
Laboratorioa  roauitod  in  MAUOE  (for  Morao  Automatic 
Oacodor),  one  of-  tha  eartieat  machine  tranalatora  of 
manual  Morao  code  [2£,4,S].  MAUDE  and  other  early 
machine  tranalatora  [6.7A9]  ware  baaed  an  a  amoK  aet 
of  simple  rules,  both  statiatical  and  linguistic.  The  rxiea 
did  not  in  any  Sanaa  comprise  a  modal  of  tha  language  — 
for  instance  English  —  to  bo  translated.  Those  early 
attempts  wore  restricted  solely  to  translation.  No 
attempt  waa  made  to  incorporate  into  the  systems  an 
understanding  of  tha  information  being  sent,  nor  were 
aspects  of  the  radio  domain  taken  into  account  A  main 
promise  of  MAUOE  and  other  early  Morse  cade  systems 
—  explicitly  stated  or  implicitly  assumed  —  was  that 
Morse  code  could  bo  decomposed  into  a  sequence  of 
symbols  comprised  of  the  alphabet,  the  numerals  0 
through  9  and  punctuation  marks.  Sinca  such  a  aet  is 
complete  any  sequence  of  Morao  coda  could  In  principle 
be  expressed  by  a  sequence  of  these  symbol* 

Recently,  a  project  at  Maaaachuaetta  Inatituto  of 
Technology’s  Laboratory  for  Computer  Science  has 
developed  a  new  perspective  on  tha  manual  Morse  code 
problem.  Wo  have  taken  quits  seriously  the  character  of 
manual  Morse  coda  as  expressed  by  tha  first  sentence  of 
this  paper.  Indeed  our  approach  has  boon  to  include  in  a 
manual  Morse  code  system  extensive  models  or 
knowledge  of  tha  Morse,  radio  and  natural  language 
domains.  Further,  understanding  of  parts  of  a  Morse 
code  conversation  is  attempted,  and  it  appears  that  it  is 
necessary  to  understand  soma  parts  of  what  is 
transmitted  in  order  to  translate  it  correctly. 


a  THE  MANUAL  MORSE  CODE  PROBLEM 

Tha  problem  domain  is  that  of  hand-sent  Morse 
code  in  an  amateur  radio  network  environment  To 
provide  a  global  perspective  of  the  Morse  problem,  a 
scenario  of  a  typical  amateur  Morse  cede  network  may 
prove  haipfuL  A  Morse  code  network  can  be  thought  of 
aa  being  composed  of  a  hierarchical  collection  of  sub¬ 
networks.  A  message  in  one  sub-netwerfc  may  be 
destined  ultimately  for  another  sub-network  in  the 
hierarchy.  A  sub-network  is  typically  composed  of  a 
group  of  operators  and  a  controller  (who  is  alee  an 
operator).  Tha  network  controller's  function  is  that  of  a 
traffic  manager  and  general  overseer  to  insure  proper 
massage  flow  within  his  sub-network.  The  network  (we 
will  honcoforth  drop  the  prefix  "sub*)  becomes 
operational  whan  the  controHer  calls  it  to  ardor.  Each 
operator,  on  joining  tha  network,  communicates  to  the 
controHor  the  numbar  of  messages  the  operator  has  to 
transmit,  and  to  whom.  The  controller  matches  up 
operators  and  assigns  thorn  to  a  nearby  (usually  within 
10  or  15  kilohertz)  frequency  to  conduct  their  business. 
After  the  two  members  complete  thair  communication, 
each  reports  back  to  tha  controller,  indicating  what 
traffic  was  transmitted  ss  a  chock  of  completeness.  At 
any  time  after  being  dispatched,  a  pair  of  operators  may 
select  a  now  frequency  in  a  dearer  area  of  the  radio 
spectrum. 

All  negotiation  between  operators  and  control! or  is 
accomplished  with  tha  aid  of  a  shorthand  network 
protocol.  Often  the  protocol  is  not  follow od  precisely 
[10].  Tha  protocol  language  is  called  "chatter,'  and  it  is 
composed  of  words  whoso  generic  type  is  "Q-sign,"  "Pro- 
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Company  in  Bedford,  Now  Hampshire;  Mr.  Timothy 
Andarson  by  Computer  Corporation  of  America  in 
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Bersnek  and  Newman  in  Cambridge,  Massachusetts;  and 
Mr.  David  Sherry  by  the  Xerox  Palo  Alto  Research 
Center  at  Xerox  Parc,  Psio  Alto,  California 


sign*  or  "Call-sign."  The  vocabulary  of  chatter  includes 
about  100  aaaantiat  words  and  aa  many  aa  900  others. 
Tha  chattar  language  allows  an  oparator  to  make 
statements,  ask  questions,  and  give  orders.  It  provides  a 
means  for  statement  and  query  of  operator  identification, 
signal  characteristics,  rendezvous  information,  message 
traffic  information,  and  so  forth.  Chattar  contains  an 
error  recovery  procedure  for  obtaining  retransmission  of 
either  a  word,  the  first  character  of  each  word  in  the 
message,  everything  after  or  before  a  particular  word  or 
phrase  in  the  message,  etc.  The  traffic  that  is 
communicated  between  Morse-code  operators  consists  of 
messages,  each  comprised  of  a  header,  body,  and 
signature.  Header  information  typically  contains  from 
whom,  to  whom,  handling  instructions,  precedence, 
number  of  words  in  the  message,  and  other  items. 


1.  The  Radio  Domain 

There  are  a  number  of  attributes  of  Morse  code 
signals  which  we  call  radio  domain  attributes.  Some  of 
these  attributes  help  but  many  hinder  an  operator’s 
ability  to  translate  Morse  coda.  Unfortunately,  even 
those  attributes  which  helped  operators  perform 
translation  heretofore  were  a  bane  to  machine 
translators.  Transmitter  characteristics  such  as  chirp  are 
used  to  good  advantage  by  an  operator  to  separate  a 
wanted  signal  from  interference  Not  only  did  machine 
translators  not  take  advantage  of  such  transmitter 
characteristics  but  such  characteristics  hindered  the 
demodulation  process  to  the  extent  that  errors  were 
introduced  into  the  translation  because  of  the 
characteristics  In  addition  to  errors  introduced  by  the 
demodulator’s  inability  to  handle  transmitter 
idiosyncrasies  the  atmosphere  adds  its  own 
characteristics  which  interfere  with  the  Morse  code 
signal;  for  example  shot  noise,  fading,  multipath,  etc 
make  translation  difficult  for  operators  and  machines 
alike. 

(The  current  Morse  code  project  did  consider  the 
problem  of  transmitter  characteristics;  but  unfortunately, 
because  of  some  contractual  restrictions,  was  prevented 
from  dealing  with  atmospheric  characteristics.) 


Sender  Induced  Irra 


Letter-spaces  are  used  to  separate  adjacent  letters,  and 
their  ideal  duration  is  three  times  that  of  a  dot  Word- 
spaces  are  used  to  separate  adjacent  words,  and  their 
ideal  duration  is  seven  times  that  of  a  dot 

The  precision  and  correctness  of  manual  Morse 
code  is  tar  from  ideal,  even  if  problems  associated  with 
the  radio  domain  are  omittod.  Sanders  themselves 
induce  three  types  of  irregularities:  (i)  spacing -errors, 
<ii>  mark-errors  and  (iii)  spelling-errors.  More  then 
ninety  percent  of  all  sender  induced  irregularities  in 
hand-sent  Morse  code  are  spacing-errors.  Such 
irregularities  occur  when  a  sender  does  not  keep  the 
proper  ratios  between  merk-epacee,  letter-spaces,  and 
word-spaces.  The  result  is  analogous  to  spoken  language 
that  is  slurred  or  broken  by  arbitrary  pauses.  The 
segmentation  problem  in  manual  Morse  code  is  analogous 
to  the  segmentation  problem  in  continuous  speech. 
Irregular  spacing  makes  it  a  difficult  problem.  Hand-sent 
Morse  code  of  plain  English  text  often  has  a  spacing 
error  in  each  word  of  the  massage  and  some  words 
(typically  the  long  ones)  may  contain  several  spacing 
errors.  The  next  most  frequent  sender  induced 
irregularity  is  a  mark-error.  A  mark-error  occurs  when  a 
sender  omits,  adds  or  changes  the  sense  of  one  or  more 
of  the  marks  (dots  and  dashes)  making  up  a  word.  Our 
experience  indicates  that  as  many  as  twelve  percent  of 
the  words  in  a  Morse  code  message  may  contain  mark- 
errors.  Spelling-errors  which  can  be  so  classified  occur 
moat  infrequently.  Certainly  lass  than  one  percent  of  the 
errors  are  classified  as  belonging  to  this  type,  meinfy 
because  senders  know  the  chatter  language  very  well 
and  because  many  common  English  spelling  errors  mop 
directly  onto  a  spacing-error  or  a  simple  mark-error, 
making  spelling-errors  indistinguishable  from  these  other 
errors. 

Needless  to  say,  sender  induced  irregularities  in 
manual  Morse  coda  proliferate  to  the  extant  that  early 
machine  algorithms  such  as  MAUDE  often  produced 
translations  that  could  be  read  only  with  greet  difficulty, 
especially  if  the  reader  had  no  knowledge  of  Morse  code. 
Yet  operators  have  little  or  no  difficulty  coping  with  the 
irregularities  resulting  from  operator  lapses,  radio  noise 
and  interference  and  can  translate  a  manual  Morse  code 
signal  on  which  moving-threshold  translators  such  as 
MAUOE  would  produce  hopelessly  garbled  results. 


Morse  code  is  composed  of  a  sequence  of 
alternating  marks  and  spaces.  In  on-off  keyed  Morse 
code,  a  mark  is  characterized  by  the  presence  of  an 
audible  tone  for  some  interval  of  time  and  a  space  by  its 
absence  for  some  interval  of  time.  There  are  two  types 
of  marks:  dots  and  dashes.  Ideally,  the  duration  of  a 
dash  is  three  times  that  of  a  dot  Thera  are  three  types 
of  spaces.  They  are  given  various  names.  Here  we  shall 
call  them  mark-space,  letter-space  and  word-space. 
Mark-spaces  are  used  to  separate  marks  of  a  single 
character,  and  their  ideal  duration  is  that  of  a  dot. 


It  is  clear  that  good  Morse  operators  have 
conceptual  models  of  the  Morse  code  environment  that 
they  use  to  help  them  perform  their  task.  They  heve 
models  of  Morse  "sounds"  —  sequences  of  dots  and 
dashes  with  rhythm  and  timing  information  —  and  msp 
these  sounds  into  the  letters  and  words.  They  have 
models  of  the  langusge  constructs  that  are  used,  be  they 
English,  another  natural  langusge,  or  the  ehatter  language. 
Operators  form  models  of  other  operators'  idiosyncratic 


mannerisms  and  us*  these  medals  in  the  translation  and 
understanding  processes  and  in  identifying  othar 
oparators.  Oparatora  also  haw*  models  of  tha  Mors* 
coda  and  radio  domain*,  it  i*  common  knowiodfa  that 
"TH"  it  oftan  sant  with  a  short  spaca  batwaan  tha 
letters,  so  that  a  machina  oftan  intarprat*  it  as  *6*  (thus 
*6E“  is  rsaily  THE*).  Similarly,  'AN*  is  oftan  intarpratad 
a*  "P*  (thus  TO*  is  rsaily  *ANO*V  In  tha  radio  domain, 
knowledge  about  whara  relevant  oparators  ara  in  tha 
frequency  spectrum,  what  a  particular  trananattsr  sounds 
like,  how  a  signal  fadaa  and  returns  —  aH  thaaa  form  tha 
modais  that  haip  oparators  idontify,  track,  tranacriba  and 
undarstand  ons  anothar.  It  is  tha  human  being*  ability  to 
intarprat  Morsa  sounds  in  tha  contaxt  of  such  mantai 
modais  that  allows  har  or  him  to  parform  so  wall 

At  this  point  a  slight  digression  is  in  ordar. 
Listening  to  a  Mors*  coda  conversation  among  a  group  of 
operators,  one  notices  three  distinct  aspects  of  tha 
conversation.  These  correspond  to  (i)  network  chatter, 
(ii)  massage  headers,  and  (iii)  massage  bodies.  Tha 
chatter  section  is  oftan  vary  poorly  sant. 
Characteristically,  many  letters  and  words  ara  slurred  or 
separated  and  corrupted  by  other  operator  lapses  Yat 
receiving  operators  have  little  difficulty  undarstanding 
chatter,  because  they  have  a  modal  of  the  global 
situation:  what  question  was  asked  by  whom,  who  is 
currently  waiting  on  tha  network,  who  has  massage 
traffic  for  whom,  and  so  forth  This  modal  and  the  ability 
to  undarstand  tha  conversation  is  vitally  important  to 
translation 


Headers  of  messages  ara  structured  but, 
unfortunately,  not  rigidly.  Again,  to  translate  them 
correctly  ana  must  have  soma  undarstanding  of  what 
headers  are  about  For  instance,  dates  may  be  sant  as 
’8  Dec  78"  or  "8  12  78“  or  *81278"  and  times  may  be 
sant  as  "1000Z"  or  "1000".  Wa  have  written  the  dates 
and  times  in  an  ideal  manner,  but,  in  fact  they  might  bo 
—  as  a  result  of  operator  lapses  —  segmented  quite 
differently,  so  that  parts  of  numbers  are  run  together  or 
a  number  is  split  apart  and  on*  or  mar*  numbers  might 
contain  a  mark-error.  One  other  aspect  of  numbers  is 
very  important  All  numbers  in  Morsa  cod*  ara  five 
marks  long  —  sa*  [11]  under  Morsa  cod*  —  yat  they 
are  often  abbreviated  and  sant  as  what  ara  caked  cut- 
number*.  Again,  context  is  often  required  to  parform 
translation  correctly. 

Tha  body  of  amateur  radio  message  traffic  is 
typically  English  with  tome  abbreviations.  To  attempt  to 
undarstand  all  of  the  English  language  would  b*  far 
beyond  tha  scop*  of  this  research.  Wa  have  built  into 
the  system  just  enough  knowledge  to  let  it  perform  in  a 
creditable  fashion.  Our  experience  indicates  that  a 
vocabulary,  soma  rules  about  where  numerals  can  occur 
in  taxt,  rules  about  how  to  hand)*  error  signs,  and  a 
measure  of  closeness  in  a  Hamming-like  spaca  (for 
correcting  operator  induced  irregularities)  are  absolutely 
essential  to  tha  correct  translation  of  plain  taxt  Given 


Figure  1.  Tha  Three  Major  Modules  of  the  Morsa 
Coda  System  and  the  Domain  Modais  They  Use. 

those  sine  qua  none,  our  experience  suggests  that 
know! edge  about  idiosyncratic  and  irregular  behavior  of 
individual  operators  facilitates  translation 

Military  massage  traffic  differs  slightly  because  tha 
body  may  consist  either  of  plain  text  or  of  cipher  groups. 
A  massage  sant  in  dphar  has  no  languaga  context  but 
knowledge  about  the  number  of  groups  in  a  massage,  tha 
number  of  characters  in  a  group,  and  whether  groups  are 
alphabetic,  numeric,  or  mixed  is  necessary  to  translate 
such  a  message. 

Figure  1  shews  a  block  diagram  of  tha  thro*  major 
module*  of  tha  Morsa  cod*  system  under  development 
by  tha  authors.  Also  shown  are  the  necessary  domain 
model*  required  by  each  module  in  ordar  for  it  to 
parform  its  task  properly.  The  wavy  line  in  tha  diagram 
indicates  that  the  signal  processing  system,  which  is 
composed  of  special  hardware  and  a  POP-1 1  computer,  is 
not  integrated  with  the  other  major  modules  which  are 
COMOEC,  the  transcription  (or  translation)  module,  and 
CATNIP,  the  chatter  and  header  understanding  module. 
The  last  two  are  software  modules  written  in  MOL  (a 
USP-lik*  language)  [12]  and  running  under  TCPS-20  [13] 
and  ITS  [14],  Experiments  ars  conducted  independently 


for  the  signal  processing  system,  and  human  intarvantion 
it  required  to  transtar  tha  rasulta  to  the  othar  two 
modules.  COMOEC  and  CATMP  arm  waU  integrated,  with 
appropriate  feedback,  and  externally  thay  appaar  to 
bahava  aa  ana  system. 

A  taw  phraaaa  about  aach  domain  modai  may 
prova  helpful: 

1.  Modai  of  tha  radio  domain  situation  —  how  tha 
individual  transmitters  of  intaraat  sound,  i'.a,  whathor 
a  transmit  tar  haa  any  charactaristie  anvaiopa  or 
carriar  distortions,  and  it  so  what  kind  and  a  maaaura 
of  tha  amounts 

2.  Modal  of  tha  Natwork  situation  —  which 
operators  arm  logged  into  tha  natwork,  which  arm  oft 
control  frequency,  which  are  on  control  frequency 
and  where  each  operator’s  transmitter  is  tuned 
relative  to  those  of  the  other  operators  on  his 
frequency.  This  last  bit  of  information  turns  out  to 
be  quite  important,  as  we  will  show  latar,  even 
though  ail  tha  operators  are  working  in  a  thirty  to 
fifty  Hertz  band 

3.  Models  of  senders  —  the  irregularities  a 
particular  sender  may  introduce,  such  as  his  or  her 
idiosyncrasies  of  language,  a  proclivity  to  introduce 
extraneous  dots  or  omit  dots,  etc. 

4.  Models  of  language  —  tha  full  gamut  of 
possibilities  are  required  for  parsing  and 
understanding  chatter,  but  wo  have  rather  simple 
models  for  handling  message  bodies  such  as  a 
vocabulary  and  some  simple  rules  for  handling  soma 
special  constructs  and  numbers. 

S  Models  of  the  situation  —  the  systam  must  know 
when  a  question  is  asked  and  the  post! bis  range  of 
expected  answers;  it  must  know  that  a  frequency 
change  has  been  ordered  or  negotiated  and  how  to 
respond  appropriately;  and  so  forth. 


a  THE  MORSE  CODE  SYSTEM 

As  mentioned,  the  Morse  code  systam  is  composed 
of  three  major  modules  and  soma  domain  models.  We 
will  attempt  in  this  section  to  present  a  short  explanation 
of  each  of  the  major  modules. 


1.  Sianai  Processing  System 

We  believe  that  an  interesting  and  important 
development  in  the  signal  processing  area  of  tha  Morse 
coda  project  was  tha  implementation  of  a  novel  tandem 
phase-lock-loop  filter  that  utilizes  time  reversal  of  the 
input  signal.  Despite  the  use  of  time  reversal,  tha  output 
can  be*  obtained  in  real  time,  albeit  with  a  constant  delay. 


Tha  nature  of  Morse-coda  signals  --  tha  tact  that  thay 
are  on-off  keying  or  frequency-shift  keying  —  and  tha 
fact  that  initial  experiments  indicated  that  tha  transient 
response  of  the  phase-lock -loop  filter  interfered  with 
tha  measurement  of  important  signal  parameters,  lad  to 
tha  development  of  tha  novel  filter. 

There  is  a  greet  deal  of  information  contained  in 
the  audio  sound  of  a  Morse-code  signal  —  the  signal 
characteristics  per  se  —  besides  the  timing  information 
of  tha  marks  and  spaces  It  became  dear  while  running 
soma  experiments  in  indarstandmg  Morse  coda  network 
conversations  that  the  signal  characteristics  contained 
information  that  was  an  important  part  of  tha  context  of 
tha  situation;  it  is  necessary  to  extract  this  information 
in  order  to  understand  tha  natwork  conversation  <q.v.). 

A  small  digression  is  required  to  explain  what  a 
human  operator  hears  in  the  sound  quality  of  the  signal  in 
order  to  understand  what  must  be  extracted  from  Morse 
eo da  signals  Briefly,  a  human  operator  is  capable  of 
detecting  and  tracking  cartain  signals  in  a  crowded 
spectrum  of  similar  competing  signals  Note  how  one  can 
follow  a  particular  conversation  at  a  crowded  cocktail 
party.  One  can  do  fairly  well  even  with  one  ear.  There 
is  no  binaural  effect  in  the  Morse-code  domain.  This 
discriminating  ability  of  human  beings  is  evidently 
knowledge-based.  To  discriminate  signals,  an  operator 
uses  information  about  how  the  signal  sounds:  (a)  its 
frequency;  (b)  its  anticipated  frequency  drift;  (c)  its 
amplitude  an d  rate  of  chirp,  if  any;  and  fd)  tha  amount  of 
envelope  distortion  such  as  hum,  clicks,  yoop  and 
whatever  other  characteristics  of  the  waveform  can  be 
characterized.  A  good  signal-processing  front  end  should 
be  capable  of  measuring  soma,  if  not  all,  of  the  above 
signal  characteristics  and  of  using  the  measured 
characteristics  for  signal  discrimination 


a.  Tandem  Phase-t-ock-Loop 

The  general  requirements  can  be  translated  into 
specific  requirements  of  a  receiving  filter  process  for  the 
Morse-code  application  (The  specific  filter  design  is  for 
an  on-off  keyed  signal,  and  experiments  ware  conducted 
only  with  such  a  signal.  Therefore,  the  discussion  that 
follows  is  in  the  context  of  on-off  keyed  signals. 
However,  it  should  be  pointed  out  that  similar  arguments 
can  be  made,  and  similar  results  can  surely  be  obtained, 
for  tha  case  of  frequency-shift  keyed  signals.) 
Extracting  tha  on-off  timing  information  for  marks  and 
spaces  as  well  as  signal  quality  information  requires 
determination  of  the  transitions  of  tha  signal  as  well  as 
continuous  estimation  of  the  amplitude  and  frequency  of 
the  signal.  The  latter  information  serves  a  dual  purpose 
First,  it  is  used  to  characterize  transmitter  signals  for 
use  in  transmitter  recognition.  In  addition,  the  frequency 
on  which  a  station  is  transmitting  is  part  of  tha  situation 
model,  and  an  uncharacteristic  frequency  shift  of  tan  or 
several  tans  of  hertz  often  indicates  a  change  of 


Frequency 


Figure  2.  Pheee-Lock-Loop  Detector. 


‘Mndtr*  during  network  conversation  This  type  of  cue 
is  extremely  useful,  because,  after  contact  has  bean  well 
established,  operators  often  do  not  re-identify 
themselves.  Furthermore,  without  the  change  of  sender 
context  information,  some  chatter  constructs  are 
ambiguous 

The  tracking-filter  model  of  the  phase-locked  loop 
(PLL),  Figure  2  [15,  16],  is  waM  stated  to  extracting  the 
information  indicated  above  from  a  signal  in  that  it  gives 
continuous  frequency  and  amplitude  estimates,  and 
presents  a  relatively  narrowband  filter  to  a  frequency- 
modulated  carrier. 

However,  before  a  PLL  can  give  accurate 
demodulation,  it  must  achieve  lock.  (Lock  is  the  state  of 
the  PLL  when  the  voltage  controlled  oscillator  (VCQ) 
tracks  the  incoming  signal  with  a  constant  phase  lag.) 
The  time  to  achieve  lock  is  inversely  proportional  to  the 
natural  frequency  of  the  loop,  and  is  affected  by  such 
factors  as  the  initial  frequency  error  (difference  in 
frequency  between  the  VCQ  and  the  input)  and  noise  in 
the  loop. 

Chirp  is  frequency  modulation  which  frequently 
occurs  in  low  quality  transmitters.  It  is  oftsn  caused  by 
inadequate  filtering  of  the  power  supply,  which  causes 
the  oscillator  to  change  frequency  when  the  power  stage 
is  turned  on.  Thus,  the  frequency  modulation,  or  chirp, 
exists  where  the  signal  makes  a  transition  from  off  to  on, 
or  vice  versa.  Most  often,  it  exists  only  at  the  beginning 
of  the  "on*  period  or  ’mark"  of  the  Morse-code  signal. 


Unfortunately,  in  a  traditional  PLL  arrangement,  or  for 
that  matter  any  type  of  traditional  filtering;  the  transient 
response  of  the  filter  is  superimposed  on  the  signal. and 
is  largest  at  the  signal  transition  points  The  problem  is 
exacerbated  when  interference  is  considered;  as  one 
narrows  the  bandwidth  of  the  filter  to  eliminate  the 
interfering  signals,  the  period  for  which  the  filter 
transient  response  is  a  significant  factor  in  the  output  is 
lengthened.  In  the  case  of  the  PLL,  the  transient 
between  acquisition  and  lock  at  the  beginning  of  the 
signal  is  the  major  one,  because  a  PLL  will  track  the 
signal  <*jring  the  on-to-off  transition  until  it  reaches  a 
signel-to-noise  ratio  at  which  the  signal  is  lost  Thus, 
because  the  frequency  and  amplitude  estimate  of  the 
signal  prior  to  lock  contains  important  information,  i.e,  the 
chirp  information  and  the  time  at  which  the  mark  began,  it 
is  desirable  to  reconstruct  that  portion  of  the  signal 

A  number  of  ways  of  recovering  the  pre-lock 
information  can  be  conceived  The  method  settled  upon 
is  simple.  It  involves  sampling  and  storing  the  input  to 
the  PLL,  and  than,  after  the  PLL  has  completed 
processing  the  mark  in  the  forward  direction,  sending  the 
stored  samples  in  reverse  order  through  the  loop.  The 
loop  then  demodulates  a  time-reversed  replica  of  the 
original  signal,  and  the  original  leading-edge  information  is 
reliably  obtained  from  the  trailing  edge  of  the  reversed 
signal. 

Because  it  was  desirable  to  run  the  process  in 
real  time,  only  the  beginning  portion  of  the  signal  is 
reversed  and  a  second  PLL  is  used  to  demodulate  it  so 


Figure  3.  Tandom  Phase  Lock-Loop  Detection  System  with  Wav*  Form  Reconstruction. 


that  tha  first  PU.  can  cantinua  to  demodulate  tha  forward 
signal.  In  addition,  tha  ravarsad  signal  can  bo 
comprassad  in  tima  by  sanding  tha  ravarsa-ordar 
sampias  through  tha  sacondary  PU.  at  a  rata  faatar  than 
thay  warn  collected.  Of  course,  tha  sacondary  PU. 
naads  to  run  at  a  higher  fraquancy.  Thus,  it  is  poaaibia 
to  hava  reconstructed  tha  mark  bafora  tha  naxt  mark 
begins. 

Tha  tandem  phase-lock-loop  system  is  shown  in 
Figure  3. 

Tha  input  is  sampled  and  stored  in  a  last-in-first-out 
(UFO)  memory.  Meanwhile  tha  input  to  tha  sacondary 
PU.  is  taken  from  tha  quadratics  phase  of  tha  primary 
PU.  Whan  lock  is  indicated  (by  tha  quadrature,  or 
correlation  output  of  tha  primary  PU)  tha  input  of  tha 
sacondary  PU.  is  switched  to  tha  digital-to-onalog 
converter  (0AC5,  and  tha  stored  samples  are  read  out  in 
reverse  order.  Tha  outputs  of  tha  sacondary  PU  are 
taken  as  tha  demodulated  signal  for  tha  tima  prior  to 
lock.  Tha  outputs  of  both  PU’s  are  sampled  and  a  POP- 
1 1  program  reconstructs  tha  waveform  (mark). 
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2.  COMOSC;  The  Morse  coda  Translation  System 

Tha  CQMDEC  (computerized  Morse  Decoder) 
control  structure  was  designed  to  consist  of  any  number 
of  modules,  each  of  which  would  ba  an  'expert*  on  one 
aspect  of  translation.  Each  module  can  add  suggested 
translations  to  a  lattice  of  possible  translations. 
Successive  modules  decide  whether  further  error 
correction  is  necessary  by  examining  tha  quality  of 
existing  suggested  translations  in  tha  lattice,  than  pass 
that  section  of  coda  to  tha  naxt  module  in  tha  chain,  and 
so  on,  until  each  translator  module  has  examined  tha 
entire  massage.  A  major  part  of  tha  design  is  an  N- 
di mansions!  metric  for  measuring  trial  translations  and  a 
heuristic  algorithm  for  comparing  tha  measures. 

Translator  modules  are  ordered  approximately  in 
terms  of  increasing  severity  of  the  sanding  errors  they 
are  able  to  correct  Thus,  spacing-errors  are  corrected 
first  Later,  mark-errors  are  corrected. 

Tha  first  module  to  process  a  code  samoie  is  a 
moving-average  translator,  a  hybrid  of  MAUDE  [2]  and 
FRAUD  f 71.  It  classifies  marks  and  spaces  into  tha 


appropriate  typaa  by  comparison  with  continuously 
updated  averafes  tor  each  type  and  thresholds  between 
types.  More  importantly,  it  aaaodatea  with  each  mark 
and  space  a  number  which  represents  its  confidence  that 
the  classification  was  correct  These  assignments  and 
confidences  are  used  by  later  modules  to  select  arose  of 

the  sample  where  errors  appear  to  hove  occur od 


a.  Handing  the  Segmentation  Problem 

The  performance  of  moving  threshold  Morse  code 
translation  systems  suffered  quite  severely  from  the 
effects  of  improper  segmentation  of  Morse  code 
sequences.  Morse  code  translators  suffer  from  two 
types  of  segmentation  difficulties:  first,  character-spaces 
and  mark-spaces  are  confused  such  that  two  characters 
are  sometimes  decoded  as  one,  eg,  ’AAf  as  ’P‘,  and  vice 
versa;  second,  word-spaces  and  character-spaces  are 
confused  such  that  ’THEY  ARC  is  translated  aa  ’THEY ARC 
and  HOLD’  is  translated  as  ’H  01  0*.  The  0EGAR8LER 
[17],  a  program  based  on  the  use  of  digraphs  and 
trigraphs  found  in  English  words  and  a  dictionary  of 
words,  was  capable  of  correcting  many  of  the  word- 
segmentation  errors  that  occurred  in  MAUOE  output,  but 
it  could  not  obtain  the  proper  transcription  at  Morse 
sequencea  that  had  character-segmentation  errors 
present 

Because  words  corrupted  only  by  segmentation 
errors  contain  a  correct  sequence  of  marks,  we  realized 
that  words  can  be  represented  by  the  *run-length 
sequence*  (RLS)  of  the  marks.  The  run-length  sequence 
is  a  string  of  numbers,  composed  of  the  number  of 
consecutive  dots  which  begin  the  word,  followed  by  the 
number  of  consecutive  dashes  which  follow  those  data, 
and  so  on.  (By  convention,  a  run-length  sequence  always 
begins  with  the  number  of  dots,  aven  if  the  number  is  0.) 
The  principle  is  simple  All  mark  and  character  spaces 
are  removed  from  the  representation  Because  spacing 
errors  occur  frequently  —  one  or  more  per  word  —  a 
major  source  of  difficulty  is  removed.  BamcaUy,  the  RLS 
is  a  mapping  of  words  in  the  Morse-code  domain  onto 
some  space  Each  word  along  with  its  variants,  obtained 
by  taking  ail  possible  combinations  of  character 
segmentation  errors,  are  mapped  to  the  same  point  in 
that  space 

The  reverse  mapping  of  run-length  sequence  onto 
legal  words  is  not  necessarily  usque  However,  better 
than  8SX  of  the  RLS  for  a  dictionary  of  over  10,000 
words  are  unique  and  most  of  those  that  are  ambiguous 
hav«  but  a  few  members  in  their  ambiguity  sete  The 
ambiguity  resulting  from  representing  words  by  their  RLS 
is  not  a  significant  problem,  because  often  the  simple 
heuristic  of  comparing  all  the  words  with  a  specific  RLS 
to  the  MAUOE  transcription  of  that  segment  of  Morse 
code  is  sufficient  to  determine  the  one  the  sender 
intended 


To  understand  how  the  RLS  is  used,  consider  the 
following.  The  sequence  of  dots  and  daahea  for  the  word 
'the*  was  sent  with  spaces  such  that  a  context- 
insensitive  decoder  could  not  transcribe  it  into  a  proper 
word,  i.a.,  the  sequence  sounds  like  ’6ET  A  general 
heuristic  that  one  might  think  of  adopting  is  this:  find  aM 
the  possible  words  which  that  sequence  of  marka  could 
represent  and  pick  the  one  that  seems  to  fit  best 
according  to  some  criterion.  If  one  tried  to  implement 
such  a  heuristic  in  a  brute-force  manner,  2**5  or  32 
posaibiiities  would  need  to  be  considered,  aa  the  word 
'the*  {-  _ .)  has  three  mark  spaces  snd  two  character 
spaces  for  a  total  of  five  spaces.  Thus  32  permutations 
and  look-ups  would  be  required  to  find  ’the*  and  any 
other  words  with  that  mark  sequence.  On  the  other 
hand,  using  the  RLS  representation  with  no  permutation* 
and  one  look-up,  *tha*  and  ail  other  legal  wards  with  that 
RLS  are  returned.  Consider  the  word  "complex",  not  an 
unusual  or  overly  long  English  word.  If  it  were  sent  with 
a  spacing  error,  and  a  brute-force  method  were  used  to 
obtain  possible  candidate  words,  the  possible  space* 
associated  with  the  sequence  of  dots  and  dashes  would 
require  more  than  two  million  (2**21)  permutations  and 
look-ups,  only  to  discover  that  in  the  dictionary  currently 
in  us*  only  the  word  ’complex*  could  be  represented  by 
such  a  sequence  of  marks.  Again,  the  RLS  representation 
would  return  the  word  ’complex*  in  one  look-upi 

The  use  of  the  RLS  solves  only  the  character 
segmentation  problem.  Word  segmentation  is  handled  by 
a  heuristic  that  splits  or  joins  character  sequence*.  The 
heuristic  uses  the  confidence  levels  returned  by  the 
MAUOE  decoder  to  determine  how  to  proceed.  (For 
more  details  on  RLS  translators,  sae  [18,  19,  20]). 


b.  Handling  Mark  Errors 

The  mark-error  correction  modules  are  able  to 
correct  eight  different  classes  of  mark-error  in  words 
which  contain  at  most  on*  mark  error.  Statistically,  the 
vast  majority  of  mark  errors  are  of  the  types  COMOEC 
can  correct,  and  occur  only  one  to  a  word.  The  types  of 
mark  errors  corrected  by  COMOEC  are  (I)  sending  an 
extra  dot,  (ii)  sending  an  extra  dash,  (iii)  sending  two 
extra  dots,  (lv>  running  two  dots  together  as  a  dash,  (v) 
splitting  a  dash  into  two  dots,  (vi)  dropping  a  dot,  (vii) 
dropping  a  dash,  and  (vii)  dropping  two  dote 

These  mark-errors  are  easily  correctable  because 
words  with  these  classes  of  mark  errors  map  to  nearby 
points  in  the  RLS  space  and  the  space  is  very  sparsely 
populated  for  containing  eleven  marks  or  more. 


c.  Handling  Error  Signs  and  Number  Constructs 

When  a  sender  recognize*  that  he  or  she  ha*  made 
an  error,  he  or  she  will  resend  the  erroneous  word, 
phrase,  or  sentence,  signalling  with  an  error  sign  that 


retransmission  is  about  to  occur.  This  behavior  is 
somewhat  anaiofous  to  a  typist  who  spaces  back  over  an 
error  and  overstrikes  it  with  X’s.  An  error  si(n  is  usually 
a  sequence  of  dots  sent  rapidly,  rarefy  fewer  than  six, 
and  rarely  more  than  twenty.  The  number  of  dots  sent 
varies  even  within  a  single  transmission,  as  does  the 
separation  of  the  dot-sequence  from  the  erroneous  code 
preceding  it  and  the  "correct"  code  following  it  More 
importantly,  the  semantics  of  an  error  sign  vary  even 
more  widely.  Thus  it  is  necessary  to  locate  and  to 
ascertain  the  meaning  of  an  arror  sign.  An  error  sign 
may  mean  to  ignore  the  previous  ward  or  characters,  or 
it  may  mean  that  the  previous  word  or  phrase  will  be 
resent,  and  so  on  Some  examples  from  actual  code  tenth 
the  symbol  "9"  used  to  represent  an  error  sign)  follow: 


must  be  allowed,  as  ail  possible  contexts  have  not  been 
or  cannot  practically  be  implemented.  If  a  number 
appears  out  of  the  contexts  in  which  a  number  is 
expected,  it  is  looked  upon  by  COMOEC  with  suspicion, 
and  it  will  be  allowed  to  remain  a  number  only  if  it  is 
relatively  well  sent 

COMOEC  searches  for  mark  sequences  that  fit 
these  criteria  and  then  attempts  to  'expand*  them  on 
either  side  (to  produce  complete  numbers).  The  only 
Imitation  on  this  algorithm  is  that  at  least  one  digit  of  an 
n-dlgit  number  must  be  sent  correctly. 


e.  The  COMOEC  Dictionaries 


ANY  8  OY  OR  GIRL  13  TO  10  •  13  UHO 

The  correct  translation: 

ANY  BOY  OR  GIRL  13  TO  13  WHO 

This  is  the  most  typical  use  of  an  error  sign  It  signals 
that  the  previous  word  or  object  was  in  error,  and  the 
sender  resends  the  word  correctly.  The  error  sign  in 
this  example  contained  thirteen  data. 
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This  is  similar  to  the  previous  example,  but  two  words 
are  erased  and  resenL  The  error  sign  contained  eleven 
dots. 

THE  CORNER  OF  UASHINGTQN 
BLVO  a  AND  SCHOOL  STREETS 

In  this  example,  the  word  "BLVO*  is  erased  — •  it  should 
not  have  been  sent  at  all.  The  arror  sign  contained 
seventeen  dots. 


d.  Handling  Numbers  in  Plain  Text 

COMOEC  recognizes  arbitrarily  long  sequences  of 
digits  as  'numbers'. 

The  problem  of  transcribing  numbers  is  analogous 
in  some  ways  to  that  of  transcribing  arror  signs,  and  it 
arises  from  the  fact  that  most  of  COMOEC’s  transcribing 
is  vocabulary-based.  Since  it  is  theoretically  possible  to 
send  a  number  containing  an  arbitrary  number  of  digits,  it 
is  impractical  to  use  a  'dictionary*  of  numbers.  Instead, 
COMOEC  takes  advantage  of  the  properties  of  the  Morse 
code  us  ad  to  represent  the  digits:  (i)  All  dgits  consist  of 
five  marks  (except  cut  numbers  which  are  also  handled), 
(ii)  Every  digits  contains  a  sequence  of  dots  followed  by 
a  sequence  of  dashes,  or  vice  versa  (iii)  A  number  very 
often  appears  in  context,  for  example,  as  a  part  of  a 
date,  time,  address,  page  number,  or  age  specification. 
This  context  is  used  to  reinforce  the  probability  that  the 
item  is  a  number.  A  number  appearing  out  of  context 


COMOEC’s  dictionary  for  plain  text  messages  is 
comprised  of  4200  root  words.  A  morphology  program 
embedded  in  the  translator  handles  a  large  but  not 
complete  set  of  word  endings.  Thus,  a  word  usually 
appears  in  the  dictionary  only  in  its  root  form.  The 
current  morphology  program  understands  the  endings  "- 
s,*  "-ed,"  '-ing,*  '-er,'  '-est,'  *-ly,'  '-lion,'  '-ment,*  and 
combinations  of  the  preceding,  such  as  *-ers.*  The 
dictionary  specifies  the  endings  taken  by  each  word. 
When  endings  are  considered,  the  effective  size  (about 
18,000  words)  of  the  dictionary  is  several  times  the 
number  of  its  roots  and  it  approaches  the  size  of  the 
conversational  vocabulary  of  the  average  English 
speaker. 


CATNIP  Chatter  and  Header  Understanding  System 
CATNIP  is  a  semantic-syntactic  augmented- 


transiti on-network  (ATN)  parser  that  chooses  a  path 
through  the  lattice  of  possible  translations  created  by 
COMOEC. 


CATNIP  uses  ATN  diagrams  to  choose  the  correct 
word  from  a  lattice  of  possible  translations.  It  starts  in  a 
certain  state  of  the  transition  network,  and  progresses 
from  one  state  to  another,  depending  an  the  next  word 
or  words  in  the  lattice.  With  each  state  is  associated  a 
list  of  words,  and  with  each  word  a  new  state.  CATNIP 
matches  the  list  of  words  from  the  state  with  the  list  of 
words  possible  at  that  point  in  the  translation  lattice, 
matches  yield  valid  new  states. 


If  that  were  all,  the  network  would  simply  be  an 
unaugmented  transition  network.  However,  CATNIP 
retains  a  context,  which  it  changes  (usually  with  every 
word)  and  which  can  be  tested  when  it  is  trying  to  match 
the  words.  The  context  indudes  such  things  as  who  is 
the  sender  of  the  current  transmission,  who  is  the 
receiver,  who  is  the  net  controller,  and  so  on.  AThTs,  as 
opposed  to  unaugmented  transition  networks,  are  good 
for  parsing  grammars  that  are  dependent  on  the  context 
and  on  past  occurrences  [21 J. 
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Naturally,  ambipjtia*  creep  in.  Sometimes  mare 
than  one  match  is  possible;  CATNIP  allows  for  this  by 
processing  one  of  the  new  valid  states  and  saving  all  the 
others.  The  context  at  that  point  is  saved  with  the 
states  thst  were  saved.  CATNIP  has  the  ability  to  return 
to  the  saved  states  and  try  those  alternate  paths. 

Finally,  CATNIP  also  has  a  limited  understanding  of 
the  events  on  the  net  Understanding  these  events  is 
important  in  understanding  the  state  of  the  net  at  any 
point  (how  many  operators  are  working,  who  they  are, 
who  is  talking,  etc)  and  it  is  important  in  choosing  the 
correct  word  at  a  particular  point  in  the  translation. 

The  context  is  used  as  the  'understanding"  part  of 
CATNIP.  Take  the  following  transmission  as  an  example: 
"ROCK  ROCK  ROCK  OE  SALT  SALT  QSA  ?  QRK  ?  K“. 
Upon  completion  of  the  parse,  the  parser  would  retain  a 
context  that  contained  the  information  that  the  receiver 
was  ROCK,  the  sender  was  SALT,  and  SALT  had  asked 
ROCK  two  questions:  "What  is  my  signal  strength?"  and 
"What  is  my  intelligibility?" 

Retaining  this  kind  of  context  helps  find  the  right 
transition  and  decide  later  ambiguities  (such  as  who  is 


the  receiver  at  a  certain  paint,  if  he  or  she  was  not 
explicitly  named).  The  successive  contexts  also  furnish  a 
synopsis  of  the  entire  session  after  the  parser  is 
finished. 

CATNIP  is  a  recursive  procedure  thst  allows  one 
to  name  ATN  diagrams  of  simple  structures  (such  as  Q- 
signs  that  are  often  used),  and  to  use  those  as  parts  of 
other  diagrams  without  actually  duplicating  the  simple 
diagrams.  Thus  a  more  structured  "grammar’  can  be 
created  without  over  complicating  the  data  base. 

Figure  4  shows  a  typical  ATN  diagram.  Each 
italicized  ARC  label  such  as  "Header’  indicates  a  call  to 
another  diagram:  in  lower  caaa  are  labels  such  as 
"location",  which  indicate  that  the  labeled  input  that  will 
parse  is  a  location  (such  as  "BOSTON"  or  "BOSH;  arcs 
labeled  with  a  number  sign  mean  that  a  number  is 
acceptable  as  input  (with  tha  parenthesized  statement 
indicating  tha  meaning  of  the  number,  eg,  (nr  -  gr)  means 
number  of  groups);  and  arcs  labeled  in  upper  case  mean 
thst  literal  input  of  one  of  the  specified  labels  is 
acceptable.  The  system  currently  contains  about  25 
diagrams  with  an  average  complexity  of  the  one  shown  in 
Figure  4.  • 
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E.  RESULTS 


previously:  error-signs,  numbers  in  plain  text  and  many 
different  types  of  chatter  constructs. 


Two  sources  of  test  data  were  used  We  obtained 

approximately  30  thousand  characters  worth  of  data  from  We  measure  the  performance  of  our  Morse  code 

amateur  operators  and  others  who  could  send  and  system  in  two  ways.  First  we  compare  the  final  results 

receive  Morse  coda  This  data  was  hand-keyed  in  our  with  the  results  produced  by  the  first  stage  of  our 

laboratory.  Some  of  this  data  was  obtained  using  actual  translator,  which  is  a  MAUOE-like  moving  threshold 

transmitters  connected  to  our  cable  network  [22].  This  translator.  Second  we  compare  the  final  results  with  an 

data  was  composed  primarily  of  plain  text  with  a  small  experienced  operator’s  translation.  The  experienced 

admixture  of  network  chatter.  Chief  Warrant  Officer  operator  had  the  use  of  a  variable  speed  tape  recorder. 

Robert  Soiling  at  the  Army’s  Morse  code  school  at  Fort  He  at  times  found  it  necessary  to  replay  parts  of  the 

□evens  arranged  to  have  several  instructors  at  that  tape,  occasionally  at  slower  speeds, 

school  simulate  an  actual  network  environment  From 

that  simulated  environment  we  received  audio  tapes  figure  5a  shows  a  translation  by  a  MAUDE-like 

containing  approximately  20  thousand  characters  of  hand-  translator  of  one  of  the  plain  text  messages  contained  in 

sent  Morse  code.  This  data  was  composed  of  chatter,  the  Fort  Qevens  data.  While  it  can  be  read,  especially 

headers,  plain  text  messages  and  cipher  group  messages.  by  a  Parson  who  knows  Morse  code,  a  considerable 

It  was  the  most  useful  data  in  that  it  forced  us  to  face  »<f°rt  '*  required.  Figure  6a  shows  the  same  message  as 

many  problems  and  constructs  we  had  not  encountered  translated  by  COMOEC,  our  Morse  code  system’s 


IN  ANATT  E  MPT  TOALLEVE  EE  1 . 1  ALLE4IATE  YOU  TV  UN  E  HPLOYMENT  I  N  THE  Cl 

TY  HIM  T  1 . 1  Cl'TY.  TSE  SUNOAYGLOBE  ON  HAY30  LULL  PUBLISS  FRE  E 

ADVERT  IS  E  nENTS  FOR  SOSTON  TEE  NAGERS  SEE  KING  SUnflER  JOBS.  ANY  B  OYORGIRL 

13T0  10  1 . 113  USOLIVES  IN  BCSTQNCANP  LACE  AJOG  UANTED  AD  UI  THOUT 

CAAEEEEEEEEEEEEEEE  CHdGE  SYFILLING  OU  T  THE  C( — ..-I PON  ON  PAGE  B  23  OF  TODAYS 

PETPERt . 1  ( . 1  TODAYSPAPER  ANOMAILING  I  T  TO  SUttnER  JOBS  (-..-I 

TSE  BOS  I - IN  GLOBE  ,  BOSTON  .  HAS  S  AC  S  EEEEEEEEEEE  nAS  SAC5USE  T  TS  02107. 

TE  E  N  (. .  I ERS  HAY  ALSO  TAKE  TS  E  COUPONS  T  0  TSE  GLOBES  OOUNTOUN  OFFICE  .  A 

r  TSE  C  ORNER  OF  WASHINGTON  SLVO  EEEEEEEEEEEEEEEEE  AND  SCH  DDL  STRE  E 

TS.ORATITS  HA  I  N  OF  FICE  .  13  ( . !  HORRISS  E  Y  BLVO.  00RC4  E  STER  A 

COUPTANS  I— ..-IS  T  B  E  RE  CEIVEO  (- . 1Y  EEEI  PH  . WEDNESDAY  Ml..  — I 

HAY  T  EEEEEEEEEEE  .  nAY2T5.J0BSE  E  KI  EEEEEEEEEEE  JOBSEEKERS  HAY  BE 

&SPECIFICAS  TH  E  Y  L  IKE  I  N  HE’  NTIONING  T5E  SO  I.  .-.-.)S  OR  DAYS  T5EY  AR  E 


AVAILABTIIEI . )  AVAILABLE  FOR  E  HPLOYMENT,  THE  TYPE  OF  UTMRK  6  E  YDES  IRE 

( - -  - .  I  C ANDO  I .-.1UV  AT  U(. - !  E  S  TH  E  Y  E  XPECT.  THE 


{.  — . .  1  SUILLAPPE/fINTSE  HAY30CLASS  IFII  EEEEEEEEEEEEEEEET  CLSSISE  I  E  D  SECTION 
UNDER  EH  E  EHE  (. — ..)  ING  SI  EEEEEEEEE  HEAOING  S  IRE  ABQSTON  TEE  NAGE  RFOR  TH 
E  sunrtER  . 

Figure  5a  MAUDE  Translation  of  Plain  Text 


IN  AN  ATTEHPT  TO  Cxxxxx  a]  <ALLEVIATE>  xY0UTH>  UNEnPLQYHENT  IN  THE  Cxxxxx  a] 
CITY  .  <THE>  SUNDAY  GLOBE  ON  HAY  30  UILL  <PU8LISH>  FREE  ADVERTISEMENTS  FOR 
<80ST0N>  TEENAGERS  SEEKING  SUMMER  JOBS  .  ANY  BOY  OR  GIRL  13  TO  Cxxxxx  ml  13 
<UHQ>  LIVES  IN  BOSTON  CAN  PLACE  A  <J0B>  UANTED  AO  UI  THOUT  Ixxxxx  «]  CHARGE  <BY> 
FILLING  OUT  THE  COUPON  ON  PAGE  B  23  OF  Ixxxxx  al  TODAYS  PAPER  AND  HAILING  IT  TQ 
SUMMER  JOBS  <.>  <THE>  BOSTON  GLOBE  .  BOSTON  .  Cxxxxx  a]  MASSACHUSETTS:*  02107  . 
TEENAGERS  HAY  ALSO  TAKE  <THE>  COUPONS  TO  <THE>  GLOBES  OOUNTOUN  OFFICE  .  AT 
<THE>  CORNER  OF  UASHINGTON  <BLV0>  Cel  AND  SCHOOL  STREETS  .  OR  AT  ITS  HAIN 
OFFICE  .  1 35  HORR ISSEY  BLVO  .  <00RCHESTER>  .  <C0UP0NS>  HUST  BE  RECEIVED  THEY  S 
Pn  .  UEDNESOAY  Cxxxxx  a]  .  HAY  <2S>  .  Cxxxxx  e]  JOB  SEEKERS  HAY  BE  AS  SPECIFIC 
AS  THEY  LIKE  IN  HENTIONING  <THE>  <H0URS>  OR  DAYS  <THEY>  ARE  Ixxxxx  a]  AVAILABLE 
FOR  EnPLOYHENT  .  THE  TYPE  OF  UORK  <BE>  IY1  DESIRE  OR  CAN  00  OR  <UHAT>  UAGES 
THEY  EXPECT  .  THE  AOS  UILL  APPEAR  IN  <THE>  HAY  20  Ixxxxx  a]  <CLASSIFIED> 

SECTION  UNDER  <HE>  Cxxxxx  a]  HEAOING  <HIRE>  A  BOSTON  TEENAGER  FOR  THE  SUMMER  . 

Figure  6a.  COMOEC  Translation  of  Plain  Text 
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translator.  An  "a"  in  s guars  brackets  indicates  an  error- 
sign;  "XXXXX"  preceding  the  "a"  indicates  that  a  portion 
of  the  message,  believed  to  be  the  pert  in  error,  wee 
suppressed;  and  "<  >*  indkrates  a  word  obtained  from 
the  dictionary  by  assuming  the  sender  made  a  mark- 
error.  The  COMOEC  translator  made  four  errors: 
transcribing  "BY'  as  "THEY" ,  "THEY  as  "<8E>’I  "THE"  as 
"HE";  and  not  suppressing  "BLVO"  after  the  word 
"WASHINGTON".  To  handle  the  last  error  correctly  the 
translator  would  need  to  know  that  in  Boston,  Washington 
is  a  street  and  not  a  boulevard.  Less  than  three  percent 
of  the  words  in  the  message  were  in  error.  The 
operator  made  33  errors  in  translating  this  message:  a 
20Z  error  rate.  Both  COMOEC’s  result  and  the 
operator’s  result  are  (gate  readable. 

Figure  5b  shows  a  translation  by  a  MAUOE-iike 
translator  of  some  chatter,  a  header  and  a  portion  of  a 
ciphered  message.  Figure  6b  shows  the  identical  portion 
of  Morse  code  as  translated  by  the  cooperative  efforts 
of  COMOEC  and  CATNIP.  Both  the  operator  and  the 
COMOEC -CATNIP  translations  of  the  chatter  and  header 
were  perfect  (The  ’{  }"  at  the  beginning  indicates  that 
the  enclosed  sequence  cannot  be  translated. 


Consequently  the  translation  obtained  from  the  moving 
threshold  first  stage  translator  was  used  (This  construct 
resulted  from  a  splice  in  the  audio  tape.) 

The  COMOEC -CATNIP  tandem  translated  1 9  groups 
incorrectly  and  the  operator  32.  It  is  interesting  that  both 
the  operator’s  translation  and  and  the  machine’s 
translation  had  13  incorrectly  translated  groups  in 
common.  In  addition  to  translating  groups  incorrectly  both 
the  operator  and  the  machine  added  9  extra  groups 
because  the  semantics  of  the  error-signs  were 
ambiguous. 

One  very  encouraging  aspect  of  the  system  is  that 
on  a  DEC  20  it  is  2  to  10  times  faster  than  real-time, 
depending  upon  the  number  of  irregidarities  and  errors  in 
the  Morse  code. 

One  should  not  draw  hasty  conclusions  from  these 
results.  We  have  worked  very  hard- to  obtain  the  results 
we  have  but  the  job  is  not  yet  complete.  For  instance, 
the  chatter  and  header  constructs  preceding  the  plain 
text  message  of  Figure  6a  are  not  yet  understood  by 
CATNIP.  Furthermore,  a  different  set  of  operators  using 


"MAUDE"  decoding: 

SALT:  (,,-VVVVl  (VVViOSA?  K 

ROCK:  RRRRQSA5QSA5  K 

SALT:  RAE  METSA  S  EAEN  METS  AS  (JEN  METENK  5  GTTC  MATC  TN  A?  K 

ROCK:  RRRQRVK 

SALT:  RRR  (-..—I  NUHR  TFC  HR  TFC  ■  »  »  NR  1  GR  208  044e  -  VOLVR  QZQNT 

PQTOR  TCY (..-....I  VLHAP  XGJIN  NVRLC  TJQMfl  (V. AG  DTZTZ 
POOIH  QRCNAGZOC  E  LYOE  HSS  GQOTR#  H  L1-. - JOE  B  TXTB  91m  32 

Figure  5b.  MAUDE  Translation  of  Chatter,  Header,  and  Parts  of  a  Ciphered  MessegBL 

COMOEC  decoding: 

SALT:  H..-VVVVI1  VVV  dSA  ?  K 

ROCK:  RR  RR  OSA  S  QSA  5  K 

SALT:  RR  UR  QSA  5  UR  OSA  S  UR  ORK  S  QTC  QTC  GA  ?  K 

ROCK:  R  RR  ORV  K 

SALT:  RR  R  NU  NU  HR  TFC  HR  TFC  [BREAK] 

[BREAK] 

[BREAK] 

NR  1  GR  200  <0445>  [BREAK] 

VOLVR  OZQNT  PQTOR  TCYUH  VLHAP 

XGJIN  NVRLC  TJOMM  V14LAG  OTZTZ  _ 

POOIH  QRCNA  GZQCE  Cxxxxx  a]  GQOTR  [xxxxx  el  LYOOE  BTXTB  0153c 

Figure  6b.  COMOEC  Translation  of  Chatter,  Header  and  Parts  of  a  Ciphered  Message. 
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a  different  set  of  chatter  conventions  might  require  a 
significant  amount  of  effort  to  augment  the  domain  models 
to  understand  and  translate  the  new  chatter.  Yet  a 
human  operator  could  adapt  to  the  new  operator's  style 
and  acquire  knowledge  about  the  new  chatter 
conventions  by  listening  to  the  conversations  far  a  few 
hours  or  at  most  a  few  days. 
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THE  POSITIONING  PROBLEM  - 
A  DRAFT  OP  AN  INTERMEDIATE  SUf®URI 


Xechiam  femini 

USC/Infa rmation  Sciances  Institute 


AN  INFORMAL  INTRODUCTION 

The  positioning  proolem  arises  when  it  is  necessary 
to  locate  a  set  of  geographically-distributed 
objects  using  measurements  of  tbe  distances  between 
some  object  pairs.  In  a  Packet  Radio  Network,  for 
Instance,  any  two  network  members  that  can  talk  to 
eacn  other  may  use  a  simple  time-stamping  aecnanism 
to  measure  the  distance  between  them;  a  distance 
measurement  protocol  may  then  be  developed.  The 
problem  is  another  and  how  the  distance 
measurements  can  be  used  to  determine  the 
geograonical  Location  with  respect  to  a  given 
system  of  coordinates. 

A  knowledge  of  the  precise  location  of  eacn  network 
r.oae  is  crucial  to  tbe  operation  of  Distributed 
censors  Networks.  The  data  collected  and 
interpreted  oy  different  sensors  may  be  correlated 
and  integrated  only  If  we  know  their  precise 
Lcoation.  A  position-locating  system  may  be 
invaluaole  to  the  operation  of  a  fleet  of  vehicles, 
each  equipped  with  a  Packet  Radio  Unit.  For 
example,  monitoring  the  location  of  a  fleet  of 
security  vehicles,  aircraft,  a  tank  division,  or  a 
flodk  of  missiles  could  all  be  assisted  by  a 
position-locating  system.  Clearly  a  positioning 
system  would  be  an  important  service  to  Packet 
Radio  Network  users.  ^  .  _ 

A  few  problems  must  be  solved  before  a  good 
positioning  system  may  be  developed: 

> .  if f icient  algorithms  to  determine  the 
location  of  oojects  by  using  distance 
measurements  should  be  developed. 

2.  Conditions  under  which  a  solution  exists  or 
joes  not  exist  snould  be  identified. 

1.  Conditions  under  which  there  exists  a 

'unique  solution  should  be  established. 

u.  Conditions  under  which  there  exists  a 

finite  number  of  solutions  snould  be 
identified.  It  snould  also  oe  understood 
now  to  transform  one  solution  into  another. 

5.  Conditions  under  whicn  the  solution  is 

insensitive  to  small  measurement  errors 
snould  be  estaolisned. 

5.  Tlgnt  bounds  upon  the  accuracy  of  the 
solution  snould  be  determined. 


However,  wnile  the  formulation  of  the  problems  is 
simple,  tne  mathematical  and  algorithmic 
intricacies  of  deriving  solutions  are  perplexing. 

To  develop  some  insignt  into  the  problems,  let  us 
consider  a  few  simple  examples.  The  simplest 
positioning  problem  of  interest  is  to  locate  three 
points  using  distance  measurements.  3y  means  of 
simple  trigonometry,  this  may  usually  be  done 
easily.  However,  let  us  consider  a  degenerate 
triangle  (Figure  1).  Because  the  system  is  very 
sensitive  to  errors,  a  small  error  in  the 
measurement  may  produce  a  large  error  in  the 
computed  position.  Some  of  the  questions  to  be 
addressed  are  as  follows:  Why  is  the  degenerate 
triangle  sensitive  to  errors?  How  can  we  determine 
whether  or  not  other  systems  are  sensitive? 


Figure  1.  A  Degenerate  Triangular  System 

Positioning  systems  may  be  constructed  by  a  simple 
procedure  of  pasting  triangles  together,  and  such 
systems  may  be  positioned  by  solving  the  triangles 
from  wftlch  they  are  constructed.  For  instance, 
consider  tbe  system  of  points  depicted  in  Figure  Z. 
It  is  possible  to  locate  the  points  in  tbe  order 
numbered.  However,  the  same  system  admits  a  few 
solutions  (the  number  of  wnlch  grows  exponentially 
with  the  number  of  nodes) .  If  we  had  some  further 
information  about  the  positions  of  the  objects,  how 
could  we  use  it  to  identify  the  true  solution?  For 
instance,  if  one  node  is  known  to  oe  a  venicle 
moving  on  a  certain  road,  many  of  the  feasible 
solutions  that  satisfy  the  distance  constraints  can 
be  eliminated,  because  they  assign  the  vehicle  to  a 
position  not  on  the  road.  How  should  that 
elimination  be  effected? 


Ill-conditioned 

identified. 


problems 


Figure  Z.  A  Triangulated  Positioning  iystem 


tnat 


In  particular,  if  a  new  measurement  ware  given 
.Figure  3).  now  can  we  find  tne  rignt  solution 
among  all  possible  ones?  ,1  few  solutions  nay  match 
me  measurements  to  witnin  a  given  error.)  We  nave 
sncwn  tnat  this  last  decision  proolem  belongs  to 
t  Re  class  of  difficult  combinatorial  decision 
problems— -tna  so-called  HP  complete  problems.  The 
last  statement  also  proves  tnat  the  existence 
proolem  (i.e.,  Does  a  solution  exist?)  is  SP 
complete. 


contains  a  suosystem 
independently.  Namely, 


nay  oe  solved 
any  incremental 


construction  algorithm  must  cecide  whether  or  not  a 


given  positioning  system  is  primitive  l.e.. 


oonstructible  but  having  no  constructiole 
subsystems).  The  last  problem  seems  to  oe  a 


difficult  combinatorial  proolem  wmcn  we  suspect  to 
be  MP  complete  (thougn  we  so  not  yet  anew  now  to 
prove  this) . 


If  the  last  conjecture  is  true,  then  tne  proolem  of 
constructing  solutions  to  tne  positioning  problem, 
using  an  incremental  algorithm,  is  NP  complete. 
This  unfortunate  result  does  not  imply  that  "bruts 
force*  (i.e.,  iterative  algorithms)  snould  be 
preferred.  It  is  reasonable  to  believe  tnat  most 
actual  positioning  problems  may  be  better  solved  oy 
means  of  an  intelligent  Incremental  algorithm.  The 
precise  meaning  of  "most"  is  yet  to  be  defined. 


Numerous  other  challenging  and  interesting  related 
problems  exist.  While  we  will  not  maxe  a 
comprenensive  presentation,  we  will  examine  some  cf 
the  problems  formally,  expose  tne  difficulties,  and 
present  some  partial  solutions  we  nave  developed. 
This  report  Is  an  extended  summary  cf  our  present 
state  of  knowledge.  A  sore  detailed  report  is  now 
being  prepared. 


1.  THE  PROBLEMS 


Mot  all  positioning  systems  may  be  solved  with  the 
aid  of  triangles.  In  fact,  for  a  system  possessing 
a  3ufficlsntiy  large  number  of  nodes,  It  is  always 
possible  tnat  tne  position  of  the  nodes  can  be 
'.coated  only  by  solving  for  the  location  of  all 
points  simultaneously.  Such  unfortunate  systems 
require  an  enormous  amount  of  computation.  For  a 
simple  example,  consider  the  hexagon  cf  Figure  4. 
It  Is  impossible  to  solve  the  location  of  Its  nodes 
using  an  incremental  algorithm;  all  must  be  solved 
at  once. 


Figure  u.  A  Positioning  System  which  may  only 
pe  solved  simultaneously 


Fortunately  enough,  such  primitive  syatems  (wnose 
parts  cannot  be  positioned  unless  the  whole  system 
is)  seem  to  be  rare.  Many  positioning  systems 
pould  pe  solved  using  an  incremental  process  (wnich 
simplifies  the  solution  algorithm  and  increases  its 


The  positioning  problem  can  oe  descrloed  is 

follows: 

1.  P  4  iP)t  ?2,  .  .  .  a  set  of  points  In  - 

the  plane. 

Z.  A  set  of  distance  measurements  oetween  some 
pairs  of  points.  Eacn  measurement  datum 
consxats  of  tne  identity  of  tne  pair  ?,  and 

tne  measured  distance  d,  and  an 
J  i  J 

estimate  of  the  measurement  error  4 . 

3.  Position  coordinates  for  at  least  three 
points,  say  P^,  ?2,  P,,  to  be  oaliea  tne 

base  triangle. 


We  shall  call  the  set  of  points  ?,  together  with 
the  distance  measurements  id,,:  and  tne  base 
triangle,  a  point  system. 

1.2  PROBLEMS 

A  feasible  aoslticn  of  the  point  system  is  a  3et  cf 
coordinates  tnat  satisfies  tne  distance  constraints 
and  assigns  to  the  case  triangle  its  actual 
coordinates.  The  set  of  ail  feasible  positions 
will  be  palled  the  solution  set  of  tne  positioning 
problem.  The  positioning  problem  oonslsts  of 
characterizing  one  solution  set  namely, 


speed  and  accuracy).  However,  an  algorithm  t.nat 
■would  construct  the  location  of  a  given  point 
3ysten  py  constructing  3uosystems  first  snould  be 
able  to  identify  oonstructible  parts.  In 
particular,  suer,  an  algorithm  snould  oe  aoie  to 
decide  whether  or  net  a  given  positioning  system 


1.  Is  tne  solution  set  emDty?  ' existence) 

2.  Tees  tne  sciution  set  contain  a  continue 
of  solutions,  or  is  it  a  discrete  set)  I 
it  a  finite  set?  zer.eralizec  mi ouer.es s 
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3.  If  the  solution  sot  Is  finite,  wnat  are  all 
tne  feasible  solutions?  ( position 
construction) 

4.  What  Is  the  error  in  the  position  due  to 
propagation  of  measurement  errors?  (error 
analysis) 

5.  What  is  the  sensitivity  of  the  solution  set 
to  aeasureaent  errors?  ( sensitivity 
analysis) 

In  what  follows  we  axaoina  some  partial  anawars  to 
some  of  the  difficult  problems  posed  above. 

Note  that  in  the  sequel  we  restrict  ourselves  to 
the  problem  of  positioning  points  relative  to  eacn 
other,  i.e.,  with  respect  to  any  coordinate  system 
of  our  onoiee.  This  leaves  us  three  degrees  of 
freedom  (two  for  translation  and  one  for  rotation) 
and  tne  orientation  for  our  choice  of  the 
coordinate  system.  With  the  aid  of  the  third  item 
on  the  input  list  (section  1.1)  it  is  possible  to 
position  the  point  system  absolutely  once  it  has 
oeen  positioned  relative  to  an  arbitrary  coordinate 
system. 

2.  GEOMETRIC  RESULTS 

We  associate  wltn  the  point  system  a  graph  whose 
vertices  represent  points  and  wnose  edges  represent 
distance  measurements.  We  call  this  graph  the 
measurements  graph  and  say  that  a  given  property  of 
a  point  system  is  combinatorial  when  It  can  be 
expressed  in  terms  of  the  measurements  graph  only. 
A  structure  is  a  graph  together  with  a  mapping  of 
edges  into  positive  real  numbers  which  we  call 
Lengths.  A  positioning  system  may  be  considered  as 
a  pin- Jointed  bar  structure,  i.e.,  a  trusa. 
Problems  of  uniqueness  bf  (i.e.,  structure  of  the 
solution  set)  for  the  positioning  problem  translate 
into  problems  of  rigidity  of  the  respective  truss. 
Problems  bf  a  solution's  sensitivity  to  errors 
translate  into  problems  of  infinitesimal  rigidity 
of  tne  respective  truss  (i.e.,  admissibility  of 
infinitesimal  flexing  of  tne  truss).  Problems  of 
constructing  a  solution  to  the  positioning  problem 
correspond  to  construction  of  the  truss.  Therefore 
we  snail  use  methods  and  terminology  tnat  pertain 
to  both  structures  and  trusses. 

2.1  3131317* 

The  results  in  this  area  fall  into  three  olaases: 

1.  A  number  of  different  characterizations  of 
infinitesimal  rigidity  (error  sensitivity). 

2.  An  algorithm  to  determine  whether  or  not  a 
given  solution  of  the  positioning  problem 
is  sensitive  to  measurement  errors. 

3.  A  combinatorial  characterization  of  plane 
rigidity  in  terms  of  a  property  of  the 
underlying  measurements  graph. 

While  the  prooiea  of  structural  rigidity  has 
attracted  mathematicians,  engineers,  and  aroniteots 


for  several  centuries^,  tne  solution  to  most 
fundamental  questions  of  rigidity  are  far  from 
<nown.  Recently  Interest  in  this  age-old  problem 
has  been  renewed  C'WHITELI  77,73],  and  some 
significant  contributions  produced. 

2.1.1  Characterization  of  rigidity 

The  first  problem,  i.e.,  that  of  characterizing 
rigidity,  has  a  few  solutions,  all  of  which  employ 
local  infinitesimal  characterizations.  Loosely 
speaking,  a  structure  is  rigid  if  it  does  not  admit 
relative  motions  of  its  parts,  that  is,  if  the  only 
motions  which  it  admits  are  trivial  (i.e.,. 
translations  and  rotations).  Therefore  the  study 
of  rigidity  is  a  study  of  possible  motions.  A 
rigid  structure  corresponds  to  a  positioning 
problem  wltn  a  discrete  solution  set.  Aa 
infinitesimally  rigid  structure  corresponds  to  an 
error-insensitive  solution. 

There  are  two  approaches  to  motions  of  structures: 
It  is  possible  to  consider  the  velocity  vectors  of 
the  nodes  or  the  relative  angular  motion  of  edges 
attached  to  a  common  node.  Accordingly,  it  is 
possible  to  develop  two  notions  of  infinitesimal 
rigidity.  Another  possible  approach  is  to  consider 
the  stresses  in  the  structure  resulting  from 
applying  external  forces.  Rigidity  may  be  defined 
as  tne  ability  of  the  structure  to  resolve  forces. 
It  is  possible  to  show  that  both  tne  aoove 
approaches  are  equivalent  [CLOCK  75,  WHITELX  77, 
731. 

oasa  anal  «m;. 

1 .  Characterization  of  rigid  structures  which 
are  not  Infinitesimally  rigid. 

An  infinitesimally  rigid  structure  is 
rigid,  out  not  vice  versa.  It  is  possible 
for  a  structure  to  be  rigid  (i.e.,  admit  no 
finite  relative  motions  of  Its  parts)  yet 
to  admit  Infinitesimal  perturbations  (i.e., 
be  sensitive  to  errors).  A  typical  example 
of  an  error-rigid  rigid  structure  is  tne 
degenerate  triangle  in  Figure  1 . 

The  difficulty  in  solving  this  problem  is 
that  we  do  not  possess  extensive  tools  for 
global  analysis,  while  local  analysis  is 
wall  developed. 

2.  Characterization  of  rigidity  with  respect 
to  discontinuous  motions  sucn  as 
reflections. 

Consider  the  pair  of  pasted  triangles  in 
Figure  5(a)  below.  The  two  triangles  may 
be  positioned  with  respect  to  eacn  other  m 
two  distinct  ways.  The  two  resulting 
structures  are  rigic  (do  not  adr.it  r.on 
trivial  motions;  but  admit  relative 
reflection  of  parts.  Ttfter  structures 


^  A  partial  list  of  researchers  interestea  ir.  tr.e 
problem  includes  Rascal,  Euler,  Cauony,  Maxwell, 
layley,  Alexandrov,  and  others. 
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idalt  i  more  complex  form  cf  discrete 
movement  of  their  internal  parts.,  e.g., 
figure  5(b) . 

The  problem  of  onaraoterizing  rigidity  with 
respect  to  discrete  notions  is  difficult, 
for  we  not  only  have  to  address  a  prodlea 
of  global  analysis,  but  also  face  difficult 
problems  of  combinatorial  topology. 


3  3 


Figure  5(a) 


*  3  4  * 


1  l 

Figure  5(b) 


Figures  5(a)  and  (b).  Two  cases  of  two  rigid  struc¬ 
tures  solving  the  same  positioning  problem. 


2.1.2  Error  sensitivity 

The  different  definitions  of  infinitesimal  rigidity 
induce  different  algorithms  to  determine  whether  or 
not  a  given  structure  is  infinitesimally  rigid. 
However,  most  of  those  algorithms  have  been 
produced  by  and  for  mathematicians  unconcerned  with 
computational  efficiency.  Ve  have  developed  a 
novel  algorithm  to  test  whether  a  given  feasible 
solution  of  the  positioning  problem  is 
infinitesimally  rigid,  l.a.,  insensitive  to  errors. 

The  idea  behind  the  rigidity  testing  algorithm  is 
simple:  try  to  solve  for  an  admissible  assignment 
of  infinitesimal  velocities  that  names  the 
structure.  It  is  necessary  to  examine  only  the 
effects  of  a  velocity  assignments  over  a  set  of 
basic  circuits  of  tha  underlying  measurement  graph 
in  order  to  reduce  the  problem  to  the  solution  of  a 
linear  system  of  equations. 

■3aan  aCfl&Uflai 

1.  Establish  measures  of  error  sensitivity  and 
algorithms  to  compute  sensitivity. 


The  characterisation  of  error  sensitivity 
in  terms  of  infinitesimal  rigidity  is  too 
cruda.  We  would  like  to  have  an  estimate 
of  how  much  error  sensitivity  a  given 
structure  possesses. 

2.  Develop  sensitivity  measures  in  taros  of 
the  distance  measurements  lata,  not  the 
particular  solution  thay  yield. 

In  addition  to  tha  difficulty  Involved  in 
developing  aprlorl  sensitivity  measures,  we 
face  tha  difficulty  that  the  same 
measurement  data  may  have  a  number  of 
solutions,  each  possessing  different 
sensitivity.  Aprlorl  it  Is  even  possible 
for  a  given  system  of  measurement  data  to 
possess  some  rigid  and  some  nonrlgld 
solutions.  Figure  6  depicts  two  solutions 
of  tns  same  positioning  problem,  one  rigid 
and  the  other  infinitesimally  flexible. 


I 


Figure  6.  A  positioning  problem  possessing  a  rigid 
and  an  infiniteslmally-flexibie  solution 


We  snail  now  dsscrlbs  the  third  class  of  results  in 
more  detail. 

2.2  FROM  GEOMETRIC  TO  COMBINATORICS 
2.2.1  Stiff  graphs 

Tha  most  powerful  results  of  the  study  of  rigidity 
appear  to  bo  combinatorial  characterizations  cf 
rigid  structures.  The  ideas  behind  the  passage 
from  geometry  to  combinatorics  are  founded  on  some 
simple  intuitive  sxperienoesr  the  major  Idea  of 
which  Is  that  soma  combinations  cf  bars  (edges)  and 
hinges  (nodes)  are  rigid  for  almost  any  choice  cf 
plane  imbedding.  For  instance,  the  full  graph  rn 
three  nodes  is  rigid  for  all  plane  lmbeddings;  it 
Is  not  Infinitesimally  rigid  when  the  three  points 
are  colinsar.  Similarly,  structures  whose 


intuition  should  not  be  pursued  blindly,  however, 
when  It  comes  to  problems  cf  rigidity,  for— as  we 
snail  see— many  intuitive  expectations  turn  cut, 
surprisingly,  to  be  false. 


140 


underlying  graph  la  a  full  graph  are  rigid  for 
almost  any  plane  imbedding.  On  the  other  hand, 
some  graphs  will  produce  flexible  structures  for 
almost  any  plane  imbedding  (e.g.,  a  circuit  on  four 
nodes  is  almost  always  flexible,  except  when  one  of 
the  edges  is  assigned  a  zero  length  or  when  all 
four  points  are  colinear).  The  problem  is:  1a.1L 
possible  to  characterize  graphs  almost  all  of  whose 
embeddings  fora  rigid  structures?.  This  question 
has  been  the  major  problem  of  the  theory  of 
rigidity. 

We  shall  define  a  critical  combinatorial  property 
of  graphs  which  we  call  (plana)  stiffness.  First 
we  associate  with  a  graph  1  —  <V,E>  a  number 

f(3)  —  2(V|-|SI-3,  measuring  the  overall  excess 

of  unknowns  over  constraints.  (Each  vertex 
contributes  two  unknown  coordinates,  and  each  edge 
constrains  the  two  vertices  incident  unto  it 
through  a  single  equation  for  the  distance.  Mote 
tnat  we  discount  three  degrees  of  freedom  to 
account  for  possible  external  motion,  i.e., 
translation  and  rotation.)  The  quantity  f(G) 
measures  the  overall  freedom  of  interns!  movement 
of  the  graph. 

The  quantity  f(G)  may  be  used  to  express  the 
property  of  stiffness,  i.e.,  of  having  a  sufficient 
number  of  constraints  to  prevent  relative  motions 
of  different  parts  of  a  graph.  Loosely  speaking,  a 
graph  la  stiff  If  it  is  possible  to  remove  some 
redundant  edges  so  that  the  remaining  graph  has  0 
degrees  of  internal  freedom,  and  non  of  its 
subgraphs  has  an  excess  of  constraints  (i.e.,  a 
negative  internal  freedom).  Formally,  a  graph 
I*<V,2>  is  stiff  if  it  has  a  spanning  subgraph 
j's<V,E'>  (i.e.,  O'  is  generated  by  removing 
excessive  constraints  from  3)  such  that 

1.  f (G * ) *0  (i.e.,  G'  has  0  degrees  of  Internal 
freedom) 

2.  if  G"  is  any  subgraph  of  G'  then 

f(G")  >  0.  (i.e.,  3'  does  not  possess 

Internally  over-constrained  subgraphs). 


2.2.2  The  rigidity  theorem 

The  most  important  result  of  the  geometric  theory 
of  positioning  is 

THEOREM  (Plane  Rigidity)  3: 

A  nlkld  structure  aul  have  a  stiff 
measurements  graph.  SflamaLLU  ilacat 
ALL  —ana  Inned dines  a  stiff  graph  aCS 
rigid*. 

(Here  "almost  ail"  is  used  in  the  topological 
sense,  i.e.,  the  set  of  rigid  plane  imbeddings  of  a 
stiff  graph  is  open  and  dense  in  the  space  of 
imbeddings,  which  further  implies  that  for  any 
Borei  probability  measure  on  the  space  of  all 
lmoeddings,  continuous  with  respect  to  Lebsegue 
measure,  the  set  of  non  rigid  imbeddings  of  a  stiff 
grapn  is  of  measure  0.) 

3CLAMA:I  71,  3L0CK  75,  WHIT  ELI  79,  SOTH-ASSIMOW  ’3] 


The  theorem  above  is  a  significant  tool  for 
handling  positioning  problems,  namely,  it  makes  it 
possible  to  infer  properties  of  structure  and 
derive  answers  to  the  problems  of  positioning  by 
examining  measurement  graph  only .  The  results 
that  we  derive  will  be  true  for  almost  all 
assignments  of  distances  to  edges  of  the 
measurement  graph,  which  greatly  simplifies  the 
study  of  the  positioning  problem. 

To  further  appreciate  the  power  of  the  result  let 
us  note  in  passing  that  the  theorem  does  not 
generalize  even  to  three  dimensional  structures. 
That  is,  it  is  possible  to  have  a  sufficient  number 
of  well  distributed  constraints  and  yet  have  a 
structure  with  a  continuum  of  solutions  no  matter 
what  lengths  are  assigned  to  the  edges.  Figure  7 
below  depicts  a  typical  system.  The  combinatorial 
characterization  of  rigid  structures  in  spaces  of 
dimensions  greater  than  two  is  an  open  problem. 


2 


Figure  7.  A  counter  example  to  a  3-dlmenslonal 
rigidity  tneorem 


2.3  LIMITATIONS  OF  THE  RIGIDITY  THEOREM 

The  rigidity  theorem  guarantees  that  a  structure 
based  on  a  stiff  graph  will  almost  always  be  rigib. 
However,  it  is  possible  to  assign  lengths  tc  sdgss 
of  a  stiff  grapn  suen  that  the  resulting  structure 
admits  infinitesimal  flexing  and  is,  therefore, 
very  sensitive  to  errors.  In  fact,  the  structure 
beoomes  an  error-increasing  mechanism.  One  suen 
structure  is  the  degenerate  triangle  of  Figure  i , 
another  is  Pascal's  hexagon  depicted  in  Figure  3. 
This  hexagon  is  stiff  and  thus  rigid  for  almost  all 
plana  imbeddings.  However,  the  hexagon  is 
Infinitesimally  flexible  whenever  and  only  If)  its 
nodes  lie  bn  a  oonic  section.  This  cizarre  result, 
due  to  [CROFTCN  1873],  has  been  rediscovered 
Independently  by  many  researchers  (Including 
oursei  vas).  The  proof  follows  from  a  simp.e 
application  of  a  celebrated  theorem  of  Pascal. 

An  even  worse  oase  is  that  of  a  stiff  grapn 
admitting  a  oontinuous  motion  i.e..  an  imbedding 
of  the  stiff  grapn  thet  is  net  even  globally 
rigid).  Such  a  structure  is  leoicted  in  Figure  i. 


Ill 


5  d 


1  2. 

Figure  3.  Pascal's  Hexagon 


figure  9.  A  flexible  structure  whose 
underlying  graph  is  rigid. 


Although  the  underlying  grapn  is  stiff  (even 
overconstrained),  the  solution  set  of  the 
respective  positioning  problaa  contains  a  contlnuua 
of  solutions. 

To  produce  such  an  example  we  started  with  a 
nonrigid  structure  (l.a.,  a  mechanism) ,  then 
meticulously  added  Pars  that  did  not  constrain  the 
motion  of  the  original  mechanism.  Specifically,  we 
started  with  a  four-oar  mechanism,  selected  a  point 
on  any  diagonal  and  connected  this  point  to  points 
on  the  original  mechanism.  By  careful  selection  of 
the  connections  to  produce  two  parallelograms .  the 
motion  of  the  original  mechanism  is  not  perturbed 
oy  the  additional  "constraints."  Additional  Bars 
are  used  only  to  hold  each  of  the  original  four 
oars  together. 

The  rigidity  theorem  guarantees  that  such 
unpleasant  positioning  systems  are  extremely  rare, 
let  we  should  bear  in  mind  that  they  may  exist 
(highly  symmetric  structures  are  very  likely 
exceptions) . 

3.  COMBINATORICS  OF  POSITIONING 

Having  seen  the  significance  and  limitations  of 
stiffness,  we  will  now  study  stiff  graphs  in  order 
to  develop  methods  for  recognizing  stiffness  and 
for  tearing  structures  into  stiff  subparts,  later 
cementing  those  parts  together.  (Tearing  is  the 


natural  Instrument  for  a  dlvide-and-conquer 
approach  to  the  construction  problem.) 

3.1  CHARACTERIZATION  OF  STIFF  GRAPHS 

The  question  that  we  would  like  to  address  in  this 
section  is:  What  a  graph  stiff?  He  wish  to 

derive  necessary  and  sufficient  conditions  for 
stiffness  in  terms  of  well  known  graph  properties. 
The  most  natural  of  classical  graph  properties 
relating  to  stiffness  is  connectivity. 

3.1.1  Some  simple  characterizations 

We  have  Been  able  to  derive  a  list  of  useful 
properties  of  stiff  graphs: 

1.  A  stiff  grapn  is  a  Block  (i.s.,  has  no  cut 
vertex) . 

Z.  A  cut-set  of  a  stiff  grapn,  separating  it 
Into  two  nontrivial  components,  must 
contain  at  least  3  edges . 

3.  A  3-cut-set  of  a  stiff  grapn,  separating  it 
into  two  nontrivial  components,  cust  also 
separate  it  into  two  stiff  grapns. 

A.  If  3*<V,E>  is  a  stiff  grapn  and  V  -  7  a 
vertex  cut-set  separating  3  into  ccmpcnents 
j.  »  <Vl,£i>  and  if  tne  suograon  cf  3 
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1.  Thera  »«i at  graona  whose  nodes  noaaess 
ardltrar-t  1  v  1  arae  degrees  out  wnion  are  not  stiff . 


spanned  by  7'  is  stiff,  then  so  are  the 
suDgrapns  spanned  by  7*  u  71. 

5.  If  v  is  a  vertex  of  degree  two  in  a  graph 
3,  then  j  is  stiff  iff  the  subgraph  of  2 
foraed  by  removing  v  is  stiff  too. 

The  aOove  results  are  a  sample  from  a  larger  class 
df  results,  all  of  wnion  serve  to  establish  tools 
for  a  "divide  and  conquer"  approach  to  the 
stiffness  problem.  For  instance,  we  would  lilte  to 
determine  whether  a  given  graph  can  be  torn  into 
stiff  subparts  which  aay  later  be  used  to 
synthesise  the  original  graph.  Figure  10  depicts  a 
typical  configuration,  corresponding  to  result  (3) 
in  the  above  list.  Figure  11  depicts  another 
possible  configuration  whicn  admits  tearing,  this 
time  a  particular  instance  of  (4). 

3.1.2  Stiffness  and  connectivity 

loosely  s pe axing,  stiffness  is  a  property  of  grapns 
wnion  has  to  do  with  the  density  of  edgea,  i.a..  It 
13  a  measure  of  how  well  different  nodes  are 
attached  to  each  other.  It  is  only  natural  to 
expect  that  such  a  property  should  bear  relation  to 
classical  measures  of  edge-density. 

There  are  three  major  classical  measures  of 
edge-density:  node  degrees,  minimal  edga  cut-sat, 
minimal  vertex  cut-set.  Here  we  explore  the 
relations  among  these  three  properties  and 
stiffness. 


To  prove  this  result  we  describe  a  simple  method  to 
construct  counter  examples.  Start  witn  a  flexible 
graph,  say  a  four-bar  mechanism.  Add  nodes  and 
edgea  ts  increase  the  degrees,  preserving  the 
flexibility.  This  process  is  demonstrated  in 
Figure  12. 

The  above  process  serves  to  show  that: 

2.  There  exist  graphs  with  aa  aC&HOCllX  UCgfc 
ml  hi  we  1  edge-cut-set  but  Which  JEft  3S1  Stiff. 

Thus,  two  of  the  classical  measures  of  connectivity 
are  not  related  to  stiffneaa.  Let  us  now  consider 
the  strongest  measure  of  connectivity, 
vertex-connect i vl ty . 

Ue  have  seen  in  the  previous  section  that  a  stiff 
graph  is  at  least  2 -connect ed.  It  is  easy  to 
construct  2  and  3 -connected  graphs  which  are  not 
stiff.  But  with  it-conneeted  grapns,  X  >  “,  the 
problem  is  no  longer  simple.  Indeed  any 
4-connected  graph  must  have  a  minimal  degree  which 
is  at  least  4.  Thus  the  total  number  of  edges  a  is 
at  least  twice  the  number  of  nodes.  Therefore  the 
overall  number  of  internal  degrees  of  freedom 
f(G)s2n-m-3  is  not  greater  than  -3.  Mot  only  is  a 
4-conneeted  graph  over-constrained,  but  the 


Figure  11.  Tearing  along  a  stiff  vertex  cut-set 


connectivity  implies  tttat  the  edges  must  se  well 
distributed.  Intuitively  one  would  expect  tnat  a 
^—connected  grapn  is  stiff. 

Met  so.  It  is  possible  to  construct  4-connected 
ana  even  5-connect  ad  graphs  which  are  flexible. 
Two  auen  examples  are  depicted  in  Figures  13  and 
14. 

The  process  we  applied  to  derive  these  two 
surprising  grapns  cannot  be  applied  to  produce  S- 
Cor  more)  connected  jrapns  which  are  flexible.  He 
lo  not  know  at  all  whether  such  graphs  even  exist . 
The  problem  is  open: 

3 .  li  there  4  number  jg  such  that  any  k-connected 
kraon  is  stiff? 

At  this  point,  nowever,  the  problem  is  mainly  of  an 
academic  interest,  for  a  condition  which  requires 
suen  a  hign  connectivity  seems  to  be  of  no 
practical  significance. 

To  summarize,  we  have  seen  that  the  relation 
oetween  stiffness  and  measures  of  connectivity  (if 
there  is  any)  are  not  simple,  contrary  to  the 
apncrl  intuition  which  leads  us  to  explore  these 
relations . 


Figure  12.  Constructing  flexible  jrapns  with 
arbitrary  node  degrees 


3.2  CONSTRUCTION  PROBLEMS 

A  position-locating  algorithm  is  essentially  1 
Process  that  starts  with  some  set  of  points  wnose 
relative  positions  are  known  and  gradually  attacnes 
new  sets  of  points  wnose  relative  positions  are 
computed  with  respect  to  tne  original  nucleus. 
Such  a  process  may  be  viewed  as  a  (possibly 
parallel)  solidification  of  parts  of  the 
measurement  grapn  into  bodies. 

To  be  able  to  describe  incremental  construction 
processes  we  need  to  introduce  a  suitable 
formalism.  In  the  following  we  shall  describe  such 
a  formalism,  then  apply  it  to  develop  and  Implement 
construction  algorithms. 

3.2.1  Stiff  hypergraphs 

A  Hvpergrann  H  =  <V,2>  consists  of  a  set  of 
vertices  V  and  a  set  of  edges  S.  An  edge  Is  a 
subset  of  vertices  (not  necessarily  Just  two,  as  in 
graphs).  We  shall  use  this  generalized  notion  of 
an  edge  to  describe  a  set  of  vertices  whose 
relative  positions  are  known.  An  edge  is  said  to 
be  incident  upon  a  given  vertex  if  it  contains  the 
vertex.  A  vertex  is  said  to  be  incident  upon  a 


Figure  '4.  A  5-conneoted  flexible  grapn 


given  edge  If  It  la  contained  in  tais  edge.  We 
snail  consider  only  hypergraphs  for  -mica  tae 
intersection  of  any  two  edges  contains  at  most  one 
point . 


of  nypergrapns  in  tae  previous  stages.  In  3nort , 
an  incremental  construction  algorithm  is  a  process 
that  traces  a  cnain  in  the  partial  order  of 
welding. 


To  develop  a  visual  intuition  of  tae  processes  to 
oe  discussed,  one  snould  consider  nypergrapns  as  a 
generalisation  of  structures.  Rather  than 
considering  pin-jointed  Pars,  we  consider 
pin-jointed  metal  sheets  of  an  aroitrary  shape  and 
number  of  pins  (each  metal  sheet  corresponding  to 
an  edge  of  the  hypergrapn  and  each  vertex 
corresponding  to  a  Joint).  To  draw  further  on  the 
analogy,  we  shall  use  the  term  link  as  an 
alternative  to  an  edge;  the  term  Saint  will  be 
employed  as  an  alternative  to  vertex. 


The  numoer  f(H)  =  2n*3m-2d(H)  la  called  the 
internal  freedom  a..  It  is  easy  to  verify  that 
for  a  graph  H  the  definition  of  f(H)  above 
degenerates  to  the  number  of  Internal  degrees  of 
freedom  defined  in  previous  sections. 

The  notion  of  stiffness  can  be  generalized  to 
nypergrapns  as  follows.  A  hypergraph  H  is  said  to 
oe  stiff  iff  it  contains  a  spanning  hypergrapn  H' 
3ucn  that 

1.  f(H'  )  s  0 


We  define  a  welder  to  be  an  operator  that  takes  a 
hypergrapn  and  produces  a  welding  of  it .  An 
incremental  construction  algorithm  is  thus  a 
process  of  successive  applications  of  welders. 

We  have  developed  software  to  represent  and 
manipulate  structures  and  nypergrapns.  Two  types 
of  welders  have  been  implemented  and  some  simple 
construction  algorithms  tried.  We  possess  the 
tools  which  are  necessary  to  develop 
position-locating  algorithms  of  increasing 
sophist ication. 
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lat  d.  denote  tbe  degree  of  the  i-th  vertex  (i.a., 
1  1 

tne  number  of  edges  incident  upon  it).  Let  d^ 

denote  tne  degree  of  the  i-th  edge  (i.e.,  the 
numoer  of  vertices  incident  upon  it).  The  degree 
of  tne  hypergrapn  H  is  defined  to  be  the  number 

n  m  # 

i(H)  s  ^  d(  s^d(,  wnere  m(n)  is  tbe  total  nua- 
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oer  of  edges  (vertices). 


2.  for  any  sudhypergrapb  H"  of  H'f(H")  >0  5 

Let  us  introduce  a  partial  order  over  hypergraphs, 
which  we  call  welding.  A  hypergraph  H’  is  said  to 
oe  a  welding  of  a  nypergraph  if  each  edge  of  H1  is  0 

a  'inion  of  edges  in  H  which  span  a  stiff 
3uohypergrapn  of  H. 

It  is  possible  to  snow  that  stiffness  Is  preserved  7 
under  welding.  Moreover,  each  hypergrapn  possesses 
a  unique  welding  which  is  maximal  (cannot  be  welded 
any  more) .  If  M(H)  designates  the  maximal  walling  . 
of  the  hypergrapn  H,  then  f(M(H))  defines  the 
degrees  of  freedom  of  H.  It  can  be  shown  that 
f ( M( H ) )  >  3  with  equality  iff  H  is  stiff,  in  which 
case  M(H)  has  a  single  edge  covering  all  the  nodes. 

3.2.2  Incremental  construction  algorithms  ^ 

An  Incremental  construction  algorithm  is  a  process 
tnat  starts  with  a  given  positioning  problem  and 
develops  a  solution  by  gradually  increasing  the 
sets  bf  points  whose  relative  locations  are  known. 

At  each  stage,  the  state  bf  the  computation  may  be 
described  as  a  hypergrapn  whose  edges  consist  of 
sets  of  points  whose  relative  positions  are  already 
known.  Such  a  hypergrapn  is  necessarily  a  welding 
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I  INTRODUCTION  AND  OVERVIEW 


A .  Problem  Background 

^  The  key  to  survival  in  a  tactical 
environment  is  the  ability  to  understand,  assess, 
and  anticipate  the  evolving  threat  situation.  This 
perception  of  the  threat  is  a  necessary 
prerequisite  for  formulating  an  appropriate 
response,  such  as  avoidance,  countermeasures,  or 
suppression. 

While  background  information  to  prepare  the 
tactical  commander  for  this  task  is  provided  from 
intelligence  estimates  and  reconnaisance  data, 
real-  or  near-real-time  Information  is  derived 
during  the  actual  engagement  from  electronic 
support  measures  (ESM)  systems.  In  the  past,  an 
ESM  system  (such  as  a  radar  warning  receiver)  could 
identify  threat  systems  from  measured  signal 
parameters  of  weapon-related  radars.  Present  and 
anticipated  threat  systems,  however,  have  evolved 
to  the  point  where  current  S3!  systems  are  no 
longer  adequate. 

In  the  air-defense  environment ,  several 
factors  can  be  identified  that  limit  the  ability  of 
current  ESI  systems  to  cope  with  the  threat 
environment.  These  factors,  which  we  will  mention 
briefly,  provide  driving  forces  to  the  development 
of  an  effective  ESM  system.  First,  air-defense 
(AD)  systems  are  densely  deployed  in  tactical 
areas,  and  can,  by  sheer  numbers,  causa  heavy 
operator  workloads.  Second,  since  there  may  be  a 
very  short  delay  between  the  time  a  threat  system 
emits  a  signal,  and  a  weapon  impacts  on  the  target, 
fast  response  time  is  required.  Third,  many  modern 
AD  systems  are  able  to  use  a  variety  of  sensors  for 
target  acquisition,  tracking,  and  weapon  guidance. 
This  means  that  a  system  to  detact  these  threats 
oust  be  able  to  acquire  a  variety  of  distinct  types 
of  sensor  data  Itself.  Even  worse,  some  systems 
will  never  emit  a  signal,  and  will  need  to  be 
actively  searched  for.  A  final  factor  of  concern 
is  that  most  AD  systems  today  are  highly  mobile, 
and  the  value  of  information  regarding  their 
location  decays  rapidly  with  time. 


This  work  is  being  supported  by  the  Advanced 
Research  Projects  Agency  of  the  Department  of 
Defense,  and  monitored  by  the  Air  Force  Avionics 
Laboratory  Jider  Contract  F336l5-77-C-i250. 


B.  Alternative  Approaches 

There  are  three  main  options  to  consider  for  a 
satisfactory  solution  to  the  problem  of  providing 
near-real-time  threat  deployment  information. 

The  first  possibility  is  a  system  that 
includes  a  human  operator  to  control  and  monitor 
data  from  the  sensor  suite,  and  draw  appropriate 
conclusions .  The  primary  advantage  of  this 
approach  is  that  the  operator  can  interpret  sensor 
data  in  the  context  of  a  wide  variety  of  knowledge 
and  experience,  which  allows  him  to  anticipate 
certain  threats,  and  ignore  those  areas  which  are 
unlikely  to  support  specific  AD  systems.  The 
drawback  is  that  operator  workload  quickly  becomes 
unmanageable. 

A  second  approach  is  to  make  a  semiautomatic 
system  by  tailoring  special-purpose  combinations  of 
sensors  into’ "super-sensors.*  These  specialized, 
hard-wired  combinations  (of  wnich  there  are  only  a 
few  examples)  provide  information  about  a  narrow 
range  of  situations,  out  lack  both  the  3cope  and 
the  flexibility  to  be  of  more  general  use. 

The  third  approach  is  to  build  an  automatic 
system  that  can  operate  a  variety  of  sensors, 
integrate  and  interpret  their  data,  and  have  the 
flexibility  to  adapt  to  changing  situations.  There 
are  (at  least)  two  avenues  that  could  be  pursued 
here,  a  classical  statistical  approach,  and  an 
approach  combining  elements  of  probability  theory 
with  symbolic  inference  mechanisms. 

A  purely  classical  approach  suffers  from 
number  of  drawbacks.  The  rapidly  changing 
characteristics  of  the  threat  situation  makes  it 
unlikely  that  an  adequate  problllty  model  would  be 
available.  If  an  acceptable  model  were  available, 
the  number  of  states  to  be  Included  would  be 
overwhelming. 


C.  An  Autonomous  Multi-Sensor  ESM  System 

The  problem  of  Interpreting  sensor  information 
to  determine  the  threat  situation  is  viewed  as  a 
perception  problem.  Our  approach  to  this  problem 
is  to  interpret  sensor  data  in  the  context  of  a 
prior  model.  The  approach  combines  probabilistic 
techniques  with  inference  mechanisms  in  an  attempt 
to  realize  the  advantages  of  the  three  approaches 
mentioned  above. 

The  problem  separates  naturally  into  three 
parts:  inferring  likely  threats  from  available 
information,  planning  and  executing  a  sensor 
utilization  strategy,  and  integration  of  the  sensor 
data  into  the  situation  model.  A  key  aspect  of  the 
approach  is  the  idea  of  using  prior  Information  to 
anticipate  threats  likely  to  be  encountered.  By 
anticipating  likely  threats,  the  system  achieves 
two  Important  advantages.  First,  the  domain  of 
discourse  is  limited.  That  is,  a  large  number  of 
possible  threat  systems  can  be  reduced  bo  a  smaller 
number  of  plausible  threat  systems  that  actually 
ned  to  be  considered  when  interpreting  sensor  data. 
Second,  by  determining  which  threats  are  likely, 
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the  system  can  use  its  sensors  to  best  advantage  in 
acquiring  relevant  data. 

The  primary  inference  mechanism  is  derived 
from  the  Inference  Net  (INET)  system  developed  at 
3HX[1]  The  INET  is  a  production  rule  system  that 
incorporates  a  modified  Bayesian  approach  for 
updating  the  likelihoods  of  hypotheses. 

The  131  system  is  currently  undergoing 
development.  Several  major  components  have  been 
completed,  while  others  are  expected  to  be  finished 
shortly.  The  next  section  will  discuss  the  system 
in  greater  decail. 


SISTEM  DESCRIPTION 


A.  Overview 

There  are  several  factors  which  make  this 
environment  unique  with  respect  to  others  which 
have  been  addressed  using  either  expert  system 
approaches  or  perception  techniques.  First,  it 
involves  an  intelligent,  hostile  opponent  capable 
of  deception.  Second,  the  large  number  of 
decisions  that  must  be  made  in  a  relatively  short 
length  of  time  means,  a  program  operating  in  this 
environment  must  be  autonomous.  That  is,  it  must 
require  a  minimum  of  human  interaction.  Next,  the 
system  must  be  able  to  deal  vitn  data  from  a 
variety  of  imperfect,  "incommensurate"  sensors, 
integrating  their  outputs  in  order  to  derive  the 
most  plausible  threat  assessments.  Finally,  the 
environment  can  change  rapidly,  and  the  program 
must  be  flexible  enough  that  it  can  be  easily 
modified  to  deal  with  new  threats  and  situations 
not  explicitly  anticipated  when  it  was  written. 

In  this  system,  we  are  attempting  to  compile  a 
threat  Irder  of  Battle  OB)  that  contains  the 
identity  of  all  known  threats  in  an  area  and  their 
locations.  We  first  determine  which  threats  are 
plausible  at  a  specific  location,  based  on  terrain 
factors  which  determine  accessabllity  and  the 
desireabillty  of  the  location  for  situating  an  AD 
system. 

In  addition,  we  use  an  existing  OB  which 
records  locations  of  known  threats.  Rules 
describing  typical  deployments  of  AD  systems  can  be 
used  to  infer  the  presence  (or  absence)  of  one 
threat  given  knowledge  of  another.  For  example,  a 
rule  describing  tbe  fact  that  a  particular 
antiaircraft  artillery  usually  prcvldea  low 
altitude  coverage  for  a  specific  missile  system 
allows  the  program  to  infer  the  presence  of  one, 
given  knowledge  of  the  other. 

A  list  of  anticipated  threats  derived  from  the 
inference  module  is  passed  to  »  sensor  strategy 
planner  (which  is  not  yet  implemented,  but  will  be 
based  on  work  described  in  [2]  which  prioritizes 
them  and  creates  a  plan  for  operating  the  sensors 
so  as  to  optimize  the  utility  of  tne  overall  sensor 
suite.  The  sensor  data  that  results  will  typically 
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provide  ambiguous  identifications,  which  must  then 
be  resolved. 

Disambiguation  involves  three  steps.  The 
first  uses  information  from  sensors  with 
overlapping  coverage.  The  second  uses  geometric 
and  parametric  constraints  specified  by  typical 
deployment  information.  The  third  uses  information 
about  plaslble  threat  locations  derived  in  the 
anticipation  phase  of  the  system.  The  resulting 
threat  identifications  are  integrated  back  into  the 
OB,  and  tbe  process  continued. 

The  remainder  of  this  section  will  describe 
the  main  steps  in  greater  detail. 


3.  Threat  Anticipation 

This  phase  uses  a  rule-based  inference  system 
called  the  Inference  Net  (INET)  to  draw  conclusions 
from  a  variety  of  types  of  prior  Information, 
including  graphic  data  derived  from  maps. 

In  the  INET,  knowledge  is  expressed  as  a  set 
of  Inference  or  implication  rules,  similar  to  a 
logical  implication  of  the  sort, 


This  is  a  logical  inference  rule  which  states  that 
if  statement  E  is  known  to  be  true,  then  H  is  also 
known  true  to  by  implication.  In  a  rule-based 
system,  however,  It  Is  typically  the  case  that 
knowing  E  is  true  does  not  unequivocally  establish 
H,  but  only  allows  the  truthfulness  of  H  to  be 
estimated.  This  necessitates  a  probaoliistic 
inference  approach,  where  rules  are  represented  as 


s  a. 

aere  L  represents  the  "strength"  of  the  rule,  that 
is,  the  degree  to  which  knowing  the  evidence,  S, 
allows  the  system  to  infer  the  hypothesis,  a.  1, 
then,  cen  be  used  to  "update"  the  likelihood  of  H 
given  knowledge  of  E. 

Since  it  is  frequently  the  case  that  mere  than 
one  rule  may  Impinge  on  a  given  hypothesis,  a 
computational  procedure  is  required  to  allow  the 
determination  of  the  combined  effects  of  the 
Individual  pieces  of  evidence.  Typically,  an 
assumption  of  conditional  independence  among  the 
•various  pieces  of  evidence  is  made,  and  then 
relatively  straightforward  manipulation  of  Bayes’ 
rule  allows  computation  of  the  effect  of  the 
evidence . 

H  may  in  turn  be  the  evidence  for  some  other 
node.  In  this  manner,  it  is  possible  to  create  ir. 
interconnected  network  of  rules. 

Several  types  of  knowledge  are  encoded  into 
rules  for  use  by  this  system.  These  include: 
descriptions  cf  composite  threat  systems  e.g.,  an 
3A-6  surface-to-air  missile  system  consists  of  a 
STRAIGHT  FLUSH  radar  and  a  cumber  of  trar.scorter- 
erector-launc.her  veniolas  (TE'-s;);  deployment 


information  (e.g.,  a  23-"®  antiaircraft  gun  usually 
provides  low  altitude  support  for  the  3A-6); 
terrain  influences  Ce.g. ,  5A-6  systems  are  usually 
found  near  roads  --  where  "near"  is  a  function  of 
terrain);  and  the  desirability  of  a  location  Ce.g., 
SA-os  prefer  to  occupy  hilltops). 

A  -/era ion  of  the  INET  program  was  developed 
for  irawlng  inferences  at  each  point  in  a  raster 
overlaid  on  a  designated  area[5]  and  operates  in 
two  passes.  The  first  pass  encodes  the  raster  of 
points  over  the  designated  area  with  information 
including  their  distance  from  items  of  interest, 
such  as  roads,  areas  of  clear  terrain,  or  locations 
wberea  threat  has  previously  been  observed.  These 
distances  are  accessed  during  the  second  phase  of 
the  operation  in  order  to  determine  the  likelihood 
of  evidence  nodes,  such  as  "Point  x  is  near  a 
road."  3y  propagating  such  evidence  through  a 
series  of  implications,  it  is  possible  to  compute 
the  likelihood  of  a  location  being  favorable  to  a 
specific  threat. 

The  output  of  this  program  can  be  displayed  to 
indicate  these  favorable  locations.  The  results  of 
interest  (i.a.,  the  likely  threat  locations)  are 
then  passed  to  the  sensor  strategy  module. 


C.  Sensor  Utilization  Strategies 

The  role  or  this  phase  is  to  examine  the  list 
of  anticipated  threats,  prioritize  them  (usually  by 
probable  lethality),  compare  them  to  models  of  the 
sensors  expected  performance,  select  the 
appropriate  sensors  for  detecting  the  highest 
priority  threats,  and  determine  the  sensor 
parameters  that  need  to  be  preset.  While  the 
module  for  computing  the  sensor  strategies  has  not 
yet  been  implemented,  we  can  outline  its  operation. 
First  we  will  describe  the  sensor  modules  used  by 
the  system. 

It  is  appropriate  to  mention  here  that  this 
effort  assumes  certain  capabilities  on  the  part  of 
the  sensors.  In  particular,  we  do  not  attempt  to 
deal  with  raw  data,  but  instead  assume  that  each 
sensor  has  an  associated  preprocessor  which  enables 
it  to  analyze  its  data,  detect  targets  of  interest, 
and  identify  them  (although  not  perfectly).  The 
lata  returned  from  the  sensors  consists  of  these 
identifications  with  whatever  location  information 
is  appropriate  to  the  sensor. 

Models  of  sensor  performance  are  designed  to 
represent  the  operation  of  the  sensor  with  respect 
to  a  specified  set  of  targets.  The  model  Includes 
prooabllity  arrays  which  specify  the  likelihood  of 
the  sensor  detecting  each  system,  and  the  possible 
identifications  the  sensor  could  assign  with  the 
likelihood  of  each.  These  tables  are  compiled 
assuming  certain  specified  environmental 
conditions.  For  example,  the  probability  tables 
for  an  electrooptical  imaging  sensor  might  nave 
been  compiled  assuming  a  clear  day,  a  certain  range 
to  the  target,  a  certain  focal  Length  for  the  Lens, 
and  so  on. 


In  order  to  adapt  the  models  to  environmental 
conditions,  we  provide  a  set  of  probability 
degradation  factors.  These  factors  are  combined 
and  used  to  modify  the  tables  for  existing 
conditions.  In  this  manner,  the  program  is  able  to 
estimate  the  efficacy  of  a  given  sensor  for 
detecting  and  identifying  a  specific  threat  'under 
nonstandard  conditions. 

Certain  of  these  factors  are  related  to 
controllable  sensor  parameters,  such  as  lens  focal 
length,  or  the  bandwidth  of  a  receiver  channel. 

The  strategy  system  will  attempt  to  select 
appropriate  seta  of  parameters,  and  balance  the 
"cost"  of  using  a  given  sensor  with  its  utility,  in 
order  to  create  a  oost  effective  sensor  strategy. 

Once  sensors  have  been  selected  and  their 
parameters  set,  data  is  collected  and  passed  to  the 
Interpretation  module. 


D.  Sensor  Data  Interpretation 

Since  sensors  are  not  perfect,  the 
identifications  they  render  are  likely  to  be 
ambiguous.  An  Important  job  for  the  overall  system 
is  to  disambiguate,  as  much  as  possible,  the  sensor 
returns. 

The  first  step  in  disambiguation  is  to  compare 
responses  from  sensors  whose  ooveragee  overlap.  3y 
using  the  probability  models  that  describe  the 
sensors,  it  is  possible  to  develop  Identifications 
based  on  the  composite  sens or  information  and,  in 
general,  reduce  ambiguity. 

The  next  step  is  to  use  Information  describing 
typical  geometric  deployments  of  threat  systems  and 
related  systems,  and  their  relative  parameters  to 
further  narrow  the  field  of  possibilities.  This 
phase  uses  geometric  descriptions  of  threat  systems 
(for  example,  an  SA-6  system  consists  of  a 
centrally  located  radar  surrounded  by  four  3A-b 
TELs  in  a  square  array,  100  meters  on  a  side) 
expressed  as  templates.  Using  a  variation  of  a 
linear  embedding  algorithm! n]  the  pattern  matching 
program  attempts  to  find  the  best  matches  of  known 
templates  to  the  sensor  data. 

This  has  two  effects.  It  allows  the  system  to 
use  typicsi  geometric  constraints  to  aid  in  the 
identification  of  threats.  In  addition,  since  it 
is  often  the  case  that  not  all  elements  of  a 
template  are  detected,  it  allows  the  system  to 
specify  not  only  which  elements  are  missing,  but 
where  they  should  be  located. 

The  results  of  this  level  of  interpretation 
are  compared  with  the  favorabillty  lata  computed  by 
the  inference  net,  and  confident  identifications 
entered  into  the  OB  through  the  INET.  This  new 
information  causes  the  system  to  update  its 
likelihoods  and  the  process  is  continued. 


Status 


£. 

This  system  is  currently  under  development. 
Most  elements  have  been  implemented,  with  the 
exception  of  the  strategy  planner.  Models  for 
several  typical  sensors  have  been  developed.  A 
revised  set  of  threat  rules  is  being  incorporated 
into  the  system  at  present.  The  pattern  matcher 
and  the  sensor  integration  have  been  implemented. 
The  next  phase  will  see  the  integration  of  the 
existing  pieces  and  the  development  of  the  planning 
module . 


F.  Extensions 

There  are  several  extensions  and  Improvements 
that  can  be  Identified  at  this  time.  Two  main 
suggestions  concern  the  operation  of  the  sensors  . 
and  the  probabilistic  updating  mechanism  in  the 
11IET. 

The  current  approach  does  not  address  the 
problem  of  analyzing  raw  sensor  ‘data  at  all. 

Rather,  it  assumes  certain  interpretive 
capabilities  on  the  part  of  the  sensor 
preprocessor.  The  Image  Understanding  community 
generally  believes  that  the  more  relevant,  high- 
level  information  available  to  a  sensor-data 
processing  system,  the  better  its  analysis  is 
likely  to  PeC3l  The  current  approach  consolidates 
most  of  the  relevant  information  in  the 
interpretation  routines,  where  it  is  usd  to 
criticize  sensor  performance  in  an  a  posteriori 
fasion,  rather  than  allowing  it  to  be  passed  to  the 
sensor  systems.  In  a  working  system,  with  actual 
sensors,  it  will  probably  be  critical  to  make  this 
information  available  at  a  low  level. 

Furthermore,  some  of  the  most  relevant  data 
may  be  that  which  is  being  collected  simultaneously 
py  other  sensors.  Instead  of  performing  all  data 
integration  at  a  high  level,  as  is  currently  the 
case,  it  appears  more  promising  to  create 
temporary,  direct  communication  paths  between 
individual  sensors. 

The  second  area  where  improvement  could  be 
made  concerns  the  likelihood  updating  computations 
in  the  IMET.  There  are  two  primary  drawbacks  to 
the  current  Bayesian  approach.  The  first  is  that 
assumptions  required  in  order  to  make  the 
computations  tractable  frequently  are  not  valid  la 
our  environment.  In  particular,  the  assumption  of 
conditional  independence  among  evidence  nodes 
impinging  on  a  hypothesis  is  usually  not  valid.  In 
many  cases  of  interest,  situations  are  highly 
interrelated.  For  example,  normal  encoding  of  the 
rules  linking  situations  often  leads  to  loops.  'Jse 
cf  the  independance  assumption  in  these  cases 
provides  incorrect  results.  Our  approach  has  been 
to  perform  inferences  based  on  tightly  coupled 
situations  outside  of  the  INET.  For  example,  the 
pattern  matcher  was  Implemented  after  several 
attempts  to  do  the  matching  in  the  INET.  Another 
approach  would  be  to  extend  the  INET  formalism  to 
allow  weakening  sf  some  of  the  current  INET 
assumptions , 


Furthermore,  the  use  of  point  probabilities 
( which  pervades  the  whole  system)  does  not  allow 
adequate  modelling  of  many  situations.  Instead, 
some  idea  of  the  distributions  involved  are 
required.  In  future  versions  of  the  system,  we 
will  implement  a  scheme  for  using  intervals  as  a 
more  accurate  model  of  actual  distributions. 


Ill  SUMMARY 


Our  approach  to  the  problem  of  integrating 
multisensor  data  with  prior  knowledge  has  several 
key  points  which  should  be  underlined. 

The  characterization  of  the  task  as  a 
perceptual  problem  suggested  many  of  the 
subproblems  that  had  to  be  addressed,  and  pointed 
out  the  need  for  a  common  frame  of  reference  within 
whicn  sensor  data  could  be  integrated  and 
interpreted.  In  this  system,  the  reference  frame 
is  the  Order  of  Battle.  Data  from  sensors  is 
interpreted  in  the  context  of  the  OB.  In  turn  the 
OB  provides  information  about  likely  threats. 

Another  key  point  is  the  use  of  anticipated 
threats  to  both  narrow  the  domain  of  discourse,  and 
to  enable  the  system  to  plan  for  effective  use  of 
sensors.  In  particular,  the  use  of  "extra-signal" 
information  such  as  terrain  data  and  deployment 
rules  to  produce  the  list  of  anticipated  threats 
seems  to  mimic  the  behavior  of  a  trained  and 
briefed  crew  member. 

The  incorporation  of  a  planning  element 
provides  many  capabilities.  It  provides  a 
mechanism  for  effective  use  of  modular  sensor 
descriptions,  which  can  be  added,  deleted,  or 
modified  as  the  system  configuration  is  altered. 

Use  of  a  planner  also  provides  the  flexibility 
needed  to  allow  easy  modification  of  threat  data, 
since  changing  threat  characteristics  will  result 
in  different  sensor  strategies. 

The  modular  sensor  descriptions  could  provide 
the  basis  for  extensions  to  the  system.  Sensors 
are,  in  effect,  modeled  as  knowledge  sources.  It 
appears  likely  that  more  abstract  sources  of 
information  could  be  modeled  in  the  same  manner, 
thereby  allowing  the  system  access  to  ether  types 
of  Information,  such  as  intelligence  estimates. 

The  models  will  also  allow  the  system  to  be  run  in 
a  simulation  mode,  in  order  to  determine  whicn 
characteristics  should  be  improved  to  make  a  sensor 
more  effective.  It  could  even  be  ised  to  decide 
which  tradeoffs  would  provide  the  greatest 
capabilities  in  a  sensor,  before  completing  its 
development. 

A  final  point  is  that  the  up-to-date  IE 
determined  oy  the  system  should  be  easily 
transferraoie  from  one  platform  to  mother.  This 
could  provide  the  capability  for  removing  a  tape 
cassette  from  a  returning  aircraft,  and  inserting 
It  into  one  about  to  depart,  thereby  transferrins 
the  real-time  T8  to  the  outbound  aircraft.  This 
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would  allow  maximum  us  of  the  new  information  while 
it  is  still  timely,  without  requiring  the  lengthy 
debriefing-analysis-briefing  process. 


In  summary,  the  approach  provides  the 
potential  for  an  extremely  robust  5SM  system, 
capable  of  using  a  wide  variety  of  information,  in 
ways  not  possible  in  any  existing  program.  In 
addition,  the  work  suggests  several  possible 
extensions  and  spinoffs  to  related  problem  areas. 
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Language  Design  for  Distributed  Systems 
Peter  G.  Hibbard 


In  this  paper  we  review  some  aspects  of 
language  design  and  language  Implementation 
as  they  relate  to  networks.  In  particular, 
we  examine  how  the  incremental  integration 
of  software  components  must  lead  to  a  re¬ 
assessment  of  the  role  of  the  compiler  and 
the  linker,  and  consider  what  additional 
facilities  might  be  desirable.  We  examine 
the  currenc  paradigms  for  expressing  parall- 
ism  which  have  been  developed  for  closely 
coupled  multiprocessor  systems,  and  explore 
how  well  these  satisfy  the  additional  con¬ 
straints  of  networks,  viz  the  lack  of  common 
shared  aemorv  and  the  relatively  low  communi¬ 
cation  bandwidth.  We  also  consider  briefly 
the  run-time  organizations  which  will  be 
needed  to  support  these  facilities. 

1.  Compilation 

'.’a  assume  ,  that  any  particular  task  which 
is  being  executed  by  a  network  will  comprise 
several  relatively  independent  processes 
executing  on  possibly  different  hosts.  During 
the  lifetime  of  the  taak  individual  sections 
mav  need  to  be  rewritten  or  reolacad,  or 
additional  sections  may  need  to  be  added,  and 
ie  may  be  necessary  to  perform  these  modifi¬ 
cations  while  the  rest  of  the  sections  continue 
to  ooerate.  Traditionallv  such  modifications 
would  be  done  by  employing  some  form  of  sep¬ 
arate  compilation,  followed  by  a  linkage 
editing  ohasa  exerting  oniv  weak  checking  on 
the  excermai  soecif ications  of  the  components. 
However,  we  believe  that  any  interaction  which 
takes  place  between  sections  of  program  should 
be  subject  to  the  usual  checks  provided  by 
compilation  -  these  checks  snouid  ac  least  in¬ 
clude  cype  checking  of  messages  passed  between 
processes.  We  are  currently  exploring  che 
oossibilit"  of  checking  processes'  specif lca- 
cions  of  cheir  patterns  of  use  of  resources 
against  s'-scem-vide  constraints. 

1.  Specifying  parallelism 

Considerable  progress  has  been  made  in 
devising  and  assessing  che  techniques  required 
to  coordinate  pseudo-parallel  and  parallel 
tasks.  However,  the  diversity  of  che  casks 
which  It  Is  required  to  coordinate  is  30  greet 
that  chev  cannot  all  be  programed  with  1  single 
tecnnique.  In  part  this  oroblem  arises  because 
we  lack  sufficient  experience  In  programming 
parallel  algorithms,  and  in  part  it  arises 
because  the  current  programming  control  con¬ 
structs,  onto  which  che  parallel  constructs 
are  grafted,  overspeclfv  the  sequentially  in 
some  cases,  and  do  not  provide  any  means  for 
expressing  parallelism  in  ocher  constructs. 
rorcunacelv,  when  dealing  with  networks  the 
restricted  environment  causes  schemes  based  on 


message  passing  to  be  attractive,  and  a 
number  of  proposals  have  recently  been 
based  upon  these  primitives  (Hoare,  3rincn 
Hansen,  Hewitt).  We  examine  one  such  tech¬ 
nique  (eventual  values)  In  detail,  since  it 
provides  a  high  level  model  for  interprocess 
which  closely  relates  to  the  manipulation 
of  values  In  algebraic  languages,  and  dis¬ 
guises  the  details  of  camunlcation  and 
remote  computation  in  3uch  a  way  as  Co  allow 
specification  of  programs  which  are  reasonably 
Independent  of  Che  network  configuration. 

3.  Huntime  organizations 

There  will  be  several  different  pat¬ 
terns  of  interaction  between  che  processes 
which  comprise  a  task.  In  the  pactern  of 
interaction  which  comprises  a  server  process 
serving  several  client  processes.  It  Is 
possible  for  an  Interaction  to  take  olace  by 
only  one  round  of  messages  passing  between 
them.  In  this  caae  the  server  need  keep  no 
record  of  internal  state,  and  it  may  release 
any  resources  aqulred  for  the  interaction. 
However,  it  is  possible  for  the  Interaction 
co  be  a  transaction  which  comprises  several 
messages,  and  che  server  process  must  main¬ 
tain  some  internal  state,  and  retain  access 
to  resources  until  the  transaction  is  com¬ 
plete.  In  the  case  where  the  completion  does 
not  occur,  because  either  the  server  or  the 
client  has  catastrophically  terminated,  it 
will  be  necessary  to  restore  che  resources 
to  a  consistent  state,  or  co  restore  them 
back  to  che  3tate  which  pertained  ac  che 
start  of  the  transection.  Techniques  for 
handling  these  problems,  ac  least  in  simple 
cases,  have  been  developed  foe  distributed 
daca  bases  and  file  systems.  Essentially  chev 
rely  on  simulating  the  effects  of  the  oper¬ 
ations  requested  bv  the  client,  without 
destructive  change  to  the  resource.  This 
only  cakes  place  when  the  client  signals 
that  all  interactions  in  chac  transaction 
are  complete.  During  the  period  when  the 
resource  has  not  teen  uodacad,  it  is  essen¬ 
tial  that  che  server  responds  to  che  client' 3 
requests  for  actions  on  che  resource  as 
though  che  update  had  occurred.  In  che  case 
of  file  updating  and  reading,  this  is  rela¬ 
tively  simple,  however  investigations  still 
need  co  be  made  for  more  complex  cases  to 
ensure  chac  che  simulated  operations  have  a 
bounded  time  cost,  irrespective  of  the  orior 
operations  performed  on  the  resource,  "e 
resort  on  experiences  gained  in  developing 
bounded-cost  pseudo-operations  in  a  related 
programing  environment. 
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to  Kalingrad  to  Odessa  to  3aku.  This 
barrier  is  5500  Km  long,  and  may  be  said 
to  protect  almost  all  of  European  Russia. 
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SUMMARY 

First  order  considerations  of 
■:o=3ib-.‘  DSN  system  deployments  for 
strategic  defense  against  a  cruise 
missile  attack  are  presented.  The 
deployments  are  discussed  in  terms  of 
their  geometrical  coverage  and  their 
vulnerability  to  offense  concentration. 

Perimeter  defense,  ring  defense,  and 
random  area  defense  are  considered  in  the 
context  of  a  defense  of  the  Soviet  Union. 

Assumptions 

It  is  assumed  that  a  distributed 
sensor  network  is  available  for  deploy¬ 
ment  which  works  in  the  sense  that  it 
provides  the  requisite  tracking  data  to 
seme  undefined  interceptor  system  in  a 
way  that  does  not  limit  overall  system 
erformance.  It  is  assumed  that  the 
raise  missile  attack  is  massive 
'.thousands  cf  missiles)  and  responsive 
to  the  defense  deployment,  but  that  the 
PEN  elements  are  no  more  susceptible  to 
'.irect  attack  than  are  other  defense 
system  components. 

Perimeter  Defense 

The  total  perimeter  of  the  Soviet 
Union  is  over  65,000  Km,  however  most  of 
the  Soviet  economic  structure  is  enclosed 
by  a  perimeter  defined  by  the  line  con¬ 
necting  Leningrad  with  Novosibirsk,  the 
line  connecting  Novosibirsk  with  Rostov- 
Na-Danv  and  the  western  border  between 
Postov-Na-Danv  and  Leningrad.  This  area, 
shaped  like  an  ice  cream  cone,  has  a 
perimeter  of  approximately  9200  Km. :  A 
defensive  barrier  against  cruise  missile 
attack  need  net  be  a  closed  perimeter.  If 
the  defense  takes  advantage  of  knowledge 
of  cruise  missile  range  and  possible  re¬ 
lease  areas,  then  a  shorter  and  perhaps 
more  effective  barrier  might  be  deployed 
as  follows:  frt*n  Vorkuta  (app.67 CN,  643E) 


The  number  of  sensors  required  to 
implement  this  5500  Km  barrier  depends 
on  the  spacing  and  on  the  depth  of  the 
barrier  and  will  not  be  further  discussed 
here.  The  number  of  weapon  stations 
(3AM  batteries  or  manned  interceptor 
locations  required)  depends  on  their 
effective  width.  For  SAM  battery  effect¬ 
ive  widths  between  5.5  and  55  Km,  1000  to 
100  batteries  will  be  required.  The  depth 
of  sensor  net  required  to  support  these 
batteries  will  depend  on  the  details  of 
system  operation,  but  will  be  approxi¬ 
mately  equal  to  the  battery  spacing. 
Fighter  interceptors  must  be  stationed 
so  that  they  can  reach  the  cruise  missile 
before  it  leaves  the  coverage  of  the 
sensor  net.  Their  spacing  may  be  calcu¬ 
lated  from  the  cruise  missile  velocity, 
the  fighter  interceptor  average  velocity, 
and  the  DSN  depth.  For  example,  if  the 
cruise  missile  velocity  is  equal  to  the 
fighter-interceptor  average  velocity  and 
the  depth  is  50  Km,  then  the  fighters  may 
be  stationed  100  Km  apart. 

Perimeter  defenses  suffer  a  serious 
deficiency,  in  that  the  offense  may  con¬ 
centrate  his  attack  into  a  few  corridors 
and  by  means  of  local  exhaustion,  satura¬ 
tion,  or  defense  suppression  penetrate 
the  barrier  at  modest  cost.  The  attackers 
are  then  free  to  fan  out  from  the  pene¬ 
tration  corridors  and  attack  where  they 
will.  If  the  offense  requires  c  corridors 
of  width  ca  Km  per  unit  length,  then  we 
may  define  an  offense  leverage,  L,  given 
by 

L  =  (c  w)'1. 

The  reciprocal  of  L  is  the  percentage  of 
the  perimeter  resource  that  can  be  brought 
to  bear  on  the  concentrated  attack.  If 
the  offense  requires  two  corridors  per 
1000  Km  of  barrier  and  a  corridor  width 
of  20  Km,  then  L  is  25  and  only  4%  of  the 
defense  weaponry  is  usable.  To  achieve 
this  leverage,  the  offense  incurs  a  modest 
range  penalty  and  the  losses  associated 
with  exhausting  or  destroying  the  defense 
resources  in  the  corridors. 


The  views  and  conclusions  contained  in  this  document  are  those  of  the  contractor  and 
should  not  be  interpreted  as  necessarily  representing  the  official  policies,  either 
expressed  or  implied,  of  the  United  States  Government. 
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Rina  Defenses 


The  defense  can  reduce  the  offense 
leverage  by  arranging  the  perimeter  defense 
into  rings  which  enclose  the  value  to  be 
defended.  If,  for  example,  rings  of  100 
Km  perimeter  (16  Km  radius)  are  chosen 
then  the  offense  leverage  is  limited  to 
100  divided  by  u.  Thus  a  20  Km  corridor 
offers  an  offense  leverage  of  5. 


in 


The  resources  to  be  defended  will  not 
in  general  be  found  to  be  conveniently 
clustered  for  the  purposes  of  defense, 
the  absence  of  detailed  information  on 
target  clusters  or  value  distributions, 
Zipf's  law2  may  be  used  to  approximate 
the  distribution.  Thus  if  the  rings  are 
ordered  by  decreasing  enclosed  value,  then 
the  value  in  the  ith  ring  will  be  pro- 

_  t 

portional  to  (i)  By  arbitrarily  assign¬ 

ing  a  value  of  1300  to  the  first  ring,  the 
following  table  may  be  constructed: 


Value 
.  th 
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Value  in 
All  Rings  Up  To 


where  d  is  the  density  of  sites 
b  is  the  effective  battery  widt 
is  the  distance  transverse!  by 
missile,  and  p  is  the  probabili 
encountering  a  defense  battery 
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10  Km,  and  d  is  10  '  then  x  is 
is  about  37%  of  the  cruise  miss 
survive  a  1000  Km  trip  through 
containing  1  defense  battery  pe 


1000.  That 
iles  will 
an  area 
r  10,300 


Km  ,  each  battery  having  a  batt 
of  10  Km. 


ery  width 


If  the  locations  are  known  to  the 
offense,  then  rerouting  can  degrade  the 
defense  performance.  If  the  presence  of 
a  random  area  defense  is  known,  but  the 
specific  locations  are  not,  the  offense 
may  still  improve  its  position  by  routing 
the  cruise  missiles  in  a  systematic  way 
in  an  attempt  to  locally  exhaust  portions 
of  the  defense. 
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It  is  apparent  that  the  laws  of  social 
agglomeration  are  working  against  the 
defense  designer  if  he  tries  to  defend 
a  sizable  fraction  cf  the  nation's 
resources  with  small  rings. 

A  defense  based  on  uniformly  defended 
rings  'will  be  seriously  unbalanced.  For 
example  if  the  first  10  rings  are  defended 
by  13  batteries  each  (effective  battery 
width  of  10  Km) ,  then  the  tenth  ring  is 
defended  at  1C  x  the  lavel  on  a  per  unit 
value  basis  as  is  the  first  ring.  To 
produce  a  balanced  ring  defense,  the 
defense  must  equip  the  rings  in  proportion 
to  the  value  enclosed.  If  the  enclosed 
value  does  not  justify  enough  equipment 
to  fully  man  each  ring,  then  a  balanced 
defense  of  that  many  rings  at  that  pro¬ 
portion  of  defense  cost  to  defended  value 


is  not 

possible. 

Random 

Area  Defense 

Th 

e  defense  mav 

■  deploy  defense 

hatter- 

ss  at  random 

throughout  the  region 

containing  the  targets  (for  the  case  of 
the  Western  USSR  -  about  3.4  x  13 6  sq.Kn). 
If  the  offense  does  not  know  the  location 
or  i  mores  the  presence  of  these  randomly 
deployed  batteries,  then  we  car.  model  the 
encounter  process  as  random: 


•"Area  Handbook  for  the  Soviet  Union,” 

DA  PAM  550-95  contains  a  good  introduc¬ 
tion  to  Soviet  geography. 

2Cipf's  law  estimates  the  size  of  each 
member  of  an  ordered  set  of  social 
.  agglomerations  as : 


where  i  is  the  rank  order  and  3;  is  the 
size  of  the  i member.  This  is  dis¬ 
cussed  further  in  "System  engineering" 
by  Goode  &  Macho!,  McGraw  Hill  3cok  Co., 
1957. 


